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The Effects of Accelerated Revenue 
Recognition on Earnings Management 
and Earnings Informativeness: 
Evidence from SEC Staff Accounting 
Bulletin No. 101 


Jennifer Altamuro 
Massachusetts Institute of Technology 


. Anne L. Beatty | 
The Ohio State University 


Joseph Weber 
Massachusetts Institute of Technology 


ABSTRACT: The SEC issued Staff Accounting Bulletin (SAB) No. 101 to address its 
concern that firms were masking true performance by managing earnings using ac- 
celerated revenue recognition. Critics of thls Accounting Bulletin stated that it would 
eliminate industry-accepted revenue recognition practices and reduce the quality of 
reported earnings. The FASB's revenue recognition discussions echo these concems 
stating that revenues recorded prior to the completion of the earnings process contain 
value-relevant information about future performance. This paper Investigates these two 
hypotheses using a sample of firms that accelerated revenue recognition prior to SAB 
No. 101 adoption (SAB 101 firms) and a matched set of firms that were unaffected by 
this regulation (unaffected firms). Our earnings distribution tests Indicate that SAB 101 
firms are more likely to meet earnings benchmarks. Specifically, we find that, in the 
pre-adoption period, SAB 101 firms report fewer small negative and more small positive 
earnings than they do in the post-adoption period and than do unaffected firms in the 
pre-adoptlon period; SAB 101 firms report fewer small negative and more small positive 
earnings changes in the pre-adoption period compared to the post-adoption period. 
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We also document that SAB 101 firms are more ilkely to have weaker corporate gov- 
ernance and more likely to have financial covenants, providing them with greater in- 
centives to manage earnings. However, we find that the association between earnings 
and future cash flows and between unexpected earnings and earnings announcement 
period returns were higher for SAB 101 firms than for unaffected firms in the pre- 
adoption period, indicating higher earnings Informativeness for SAB 101 firms. These 
associations declined for SAB 101 firms in the post-adoption period, suggesting that 
SAB No. 101 caused a decline in earnings Informativeness. Overall, our results suggest 
that, although the revenue recognition practices targeted by SAB No. 101 have been 
used by some firms to manage earnings, the regulation's prohibition of revenue rec- 
ognition prior to completion of the eamings process, on average, results in less infor- 
mative earnings since these uneamed revenues provide value-relevant informatlon. 


Keywords: revenue recognition; earnings management; benchmark-beating; signaling; 
cash flows. 


Data Availability: Data are available from public sources identified in the paper. 


I. INTRODUCTION 

he Securities and Exchange Commission (SEC) has argued that improper revenue 

recognition is the single largest issue contributing to financial restatements.’ In his 

famous 1998 speech, "The Numbers Game," Arthur Levitt, then chairman of the 
SEC, indicated that the use of premature revenue recognition practices to meet Wall Street 
expectations is one of the five fundamental problems with current financial reporting.? 
Ultimately, the SEC's belief that premature revenue recognition reduces the quality of 
reported earnings and the integrity of financial reporting caused the SEC to issue Staff 
Accounting Bulletin (SAB) No. 101, Revenue Recognition (SEC 1999).? 

The SEC states that SAB No. 101 provides guidance on how to interpret and apply 
existing revenue recognition rules. Critics of SAB No. 101 argue that broad application of 
those criteria is not always appropriate. For example, Philip Ameen, chair of the Financial 
Executives Institute Committee on Corporate Reporting, argues that SAB No. 101 changes 
Generally Accepted Accounting Principles (GAAP), overturning standard industry prac- 
tices.* PricewaterhouseCoopers (2001) further argues that the change in accounting required 
by SAB No. 101 will result in GAAP earnings that have less predictive ability for future 
cash flows and thus less relevance in valuing companies. 

The Financial Accounting Standards Board (FASB) staff appears sympathetic to the 
criticisms of SAB No. 101, stating that the regulation goes beyond existing GAAP by 
making the revenue recognition criteria developed for the software industry the foundation 
for all basic revenue recognition principles. The FASB also argues that requiring comple- 
tion of the earnings process for revenue recognition is inconsistent with the recognition 
criteria for other financial statement components, even though revenues arise from changes 
in these other components. Based on these concerns and inconsistencies in the Conceptual 


! Turner et al. (2001) identify 381 restatements that occurred during 1997—1999. They find that revenue recognition 
restatements are the most frequent and result in the largest market reaction. 

?^ Levitt's speech, made at the NYU Center for Law and Business on September 28, 1998, can be found at http: 
I l'www.sec.gov / news/speech/speecharchive/ 1998 / spch220.txt. 

* See Osterland (2000). 

4 See Ameen (2000) and Osterland (2000) for examples of industry concerns regarding SAB No. 101 adoption. 

5 These arguments can be found in "Project Updates—Revenue Recognition" dated July 14, 2004 at the FASB's 
website: http: // www.fasb.org /project/revenue. recognition.shtmiffhistory. 
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 Framework's treatment of revenue recognition, the FASB has undertaken a project to de- 

_velop a comprehensive Statement of Financial Accounting Standards on revenue recogni- 
tion. This standard would go beyond reinstating the revenue recognition practices eliminated 
by SAB No. 101 by eliminating completeness of the earnings process as a condition for 
revenue recognition. 

In this paper, we use the reporting requirements imposed by SAB No. 101 to examine 

how accounting methods that accelerate revenue recognition affect financial reporting. First, 
we examine the SEC's concern that allowing revenue recognition prior to completion of 
the earnings process results in increased earnings management. Second, we examine the 
possibility, currently being considered by the FASB, that including unearned revenue in 
earnings provides value-relevant information about future performance. 
. To investigate these two hypotheses, we identify a sample of firms that recorded cu- 
mulative effects adjustments required by SAB No. 101 adoption (SAB 101 firms) and, 
where available, we identify the amount of accelerated revenues using the pro forma dis- 
closures. We also identify two sets of control firms whose earnings are unlikely to include 
accelerated revenue. First, we use the firm as its own control by focusing on the earnings 
of the SAB 101 firms in the post-adoption period. Second, we use firms that did not report 
an SAB No. 101 cumulative effects adjustment (unaffected firms) but are in the same 
industry, have similar asset sizes, and engage in the same types of revenue-generating 
activities as the SAB 101 firms. 

We test our earnings management hypothesis by comparing earnings distributions of 
SAB 101 firms to earnings distributions of the two sets of control firms to determine 
whether SAB 101 firms are more likely to beat benchmarks. We also compare character- 
istics of SAB No. 101 and unaffected firms in the pre-adoption period to determine whether 
SAB No. 101 firms have stronger incentives to manage earnings. 

The results of these tests are consistent with the earnings management hypothesis. 
Specifically, we find that, in the pre-adoption period, SAB 101 firms report fewer small 
negative and more small positive earnings than they did in the post-adoption period and 
than did unaffected firms in the pre-adoption period. These findings suggest that some SAB 
101 firms have been accelerating revenue recognition to avoid reporting losses. We also 
find limited evidence that firms recognize revenue early to avoid reductions in income. In 
particular, we find that SAB 101 firms report fewer small negative and more small positive 
earnings changes in the pre-adoption period compared to the post-adoption period, but these 
differences are not significantly different from those for unaffected firms. Our logistic anal- 
ysis examining the incentives to manage earnings indicates that SAB 101 firms are more 
likely to have weak corporate governance and debt with financial covenants. However, we 
find no evidence that SAB 101 firms are more likely to have recently issued new financing 
or to have earnings-based bonus plans. 

We test the hypothesis that revenues recorded in income prior to the completion of the 
earnings process are more informative by examining differences in the association between 
earnings and future cash flows and between unexpected earnings and earnings announce- 
ment period returns for our test and control firms. We also use the pro forma data reported 
by SAB 101 firms to conduct similar tests on the informativeness of accelerated earnings. 

The results of these tests are consistent with the earnings informativeness hypothesis. 
For the SAB 101 firms, the associations between future cash flows, returns, and earnings 
are higher in the pre-adoption period than in the post-adoption period. When compared to 
the associations for unaffected firms, the associations for SAB 101 firms are higher in the 
pre-period and lower in the post-period. We also find that when we use the pro forma 
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information to decompose reported earnings into accelerated and restated earnings, both 
components predict future cash flows and are associated with contemporaneous annual 
returns. These findings suggest that, despite our evidence that the revenue recognition prac- 
tices eliminated by SAB No. 101 were used to manage earnings, on average, these practices 
resulted in earnings that were more informative. 

We link our two sets of results in a supplemental analysis. We partition SAB 101 firms 
into groups based on their earnings management incentives and examine whether the results 
of the earnings informativeness tests differs across these groups. We find that, on average, 
earnings are more highly associated with future cash flows when they contain accelerated 
revenue recognition regardless of the earnings management incentives. We find mixed re- 
sults for a similar returns test. Overall, these tests suggest that, on average, earnings that 
include accelerated revenue are more informative even for groups of firms with the highest 
incentives to manage earnings. 

In summary, the SEC's adoption of SAB No. 101 provides an opportunity to examine 
the trade-off between discretion in reporting revenues prior to the completion of the earnings 
process and the relevance of unearned revenues. Although we find some evidence of in- 
creased earnings management by firms that accelerate revenue recognition, we find that on 
average, the relevance of reported earnings for these firms is greater than when the revenue 
recognition process is delayed. These results are potentially interesting to the regulators 
and standard setters engaged in developing and implementing revenue recognition standards 
who must consider the trade-offs between discretion in reporting revenues prior to the 
completion of the earnings process and the informativeness of unearned revenues. 

Section II provides background information for our study. We develop our hypotheses 
in Section HI and discuss our sample selection in Section IV. We provide a description of 
our research design in Section V. We discuss our empirical results in Section VI and 
conclude in Section VII. 


IL BACKGROUND 

In September 1998, Arthur Levitt, then chairman of the SEC, delivered a speech that 
accused corporate executives of using "accounting hocus-pocus”? to “meet or beat Wall 
Street earnings projections." He argued that manipulation of revenue recognition is one of 
five popular earnings management tricks employed to “mask the true consequences of 
management's decisions" (Levitt 1998). In his speech, he challenged "corporate manage- 
ment and Wall Street to re-examine our current environment ... and to embrace nothing less 
than a cultural change" (Levitt 1998). 

In December 1999, the SEC responded to Levitt's challenge by issuing SAB No. 101. 
The accounting bulletin outlines the SEC's four criteria for revenue recognition: (1) per- 
suasive evidence of an arrangement exists, (2) delivery has occurred or services. have been 
rendered, (3) seller's price to the buyer is fixed or determinable, and (4) collectibility is 
reasonably assured. 

Due to industry concerns about the accounting bulletin, the SEC deferred the effective 
date of SAB No. 101 until the fourth quarter of 2000. In October 2000, the SEC provided 
a list of frequently asked questions that stipulated that, consistent with APB No. 20, the 
transition to SAB No. 101 should be accounted for using a cumulative effects adjustment 
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as long as the prior accounting did not reflect an error? In January 2001, the SEC denied 
all final requests to change or further delay SAB No. 101 implementation. 

Despite the SEC's claims that any group of moderately competent accountants will 
quickly arrive at the same revenue recognition criteria, in January 2002 the FASB decided 
to issue, for public comment, a revenue recognition project proposal. The FASB's proposal 
moves away from the SAB No. 101 earnings process completion requirement for revenue 
recognition and toward allowing revenue recognition when there is a verifiable measure of 
a change in the value of a firm's assets or liabilities. The FASB's stated reason for under- 
taking this project is the lack of a comprehensive standard on revenue recognition and the 
ad hoc nature of implementation guidance. In May 2002, the FASB added this project to 
its technical agenda and plans to issue a revenue recognition initial due process document 
in the second quarter of 2005. 


HL HYPOTHESIS DEVELOPMENT 

_ The differences between SAB No. 101 and the FASB’s revenue recognition proposal 
highlight the trade-offs in the timing of revenue recognition. The SEC argues that allowing 
firms to accelerate revenue recognition will allow for misleading earnings management. The 
FASB’s position is that accelerating the recognition of unearned revenues will provide 
investors with value-relevant information about future performance. This paper provides 
evidence on these two hypotheses, recognizing that they are not mutually exclusive. 

Accelerating revenue recognition could increase the informativeness of earnings in two 
ways. First, consistent with Subramanyan (1996), some firms choose to manage earnings 
to signal value-relevant information about future performance. This is contrary to the SEC’s 
views that earnings management is used to mask rather than signal true performance. Sec- 
ond, consistent with the FASB’s argument, accelerating revenue recognition increases the 
value relevance of firms’ earnings, on average, even if some firms use accelerated revenue 
recognition to engage in misleading earnings management. 


Earnings Management Hypothesis 

Given the SEC’s concern, we posit that SAB 101 firms exhibit more evidence of earn- 
ings management in the pre-adoption than in the post-adoption period. Burgstahler and 
Dichev (1997) argue that firms have incentives to just meet or beat earnings benchmarks. 
Graham et al. (2004) find in a survey of 401 financial executives that over 65 percent of 
those surveyed thought that having positive earnings is important, 90 percent of those from 
profitable firms thought meeting earnings change benchmarks is important, and over 40 
percent would manipulate revenue recognition polices to meet these benchmarks. If SAB 
101 firms were to use the revenue recognition practices targeted by SAB No. 101 to manage 
earnings, then we would expect these firms to report fewer small negative and more small 
positive earnings or earnings changes than our control groups. 

We also expect that SAB 101 firms have greater incentives to manage earnings than 
unaffected firms. We consider four earnings management incentives identified by prior 
research. First, we expect that firms with weak corporate governance are more likely to 
engage in earnings management. Second, based on Watts and Zimmerman (1986, 1990) we 


$ APB No. 20 states that the amount of the cumulative effect adjustment shown in the income statement *'is the 
difference between (a) the amount of retained earnings at the beginning of the period of a change and (b) the 
amount of retained earnings that would have been reported at that date if the new accounting principle had been 
applied retroactively for all prior periods." APB No. 20 also stipulates that pro forma effects of retroactive 
application should be shown on the face of the income statement or disclosed prominently in the notes to the 
financial statements, 
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examine the incentive to avoid violating financial covenants and thus reduce the costs of 
existing debt financing. Third, consistent with Teoh et al. (1998a, 1998b), we examine the 
incentive to report higher earnings prior to a new capital offering to obtain a better price 
and increased proceeds from the offering. However, Increased SEC scrutiny of public filings 
for new issuances makes aggressive revenue recognition choices less appealing. The final 
incentive we consider is CEO bonus plans. 


Earnings Informativeness Hypothesis 

In its current revenue recognition project deliberations, the FASB argues that revenue 
recognition based on completion of the earnings process is inconsistent with the recognition 
criteria for other financial statement components, even though revenues arise from changes 
in these other components. Specifically, it states that the unearned revenues that result from 
this requirement do not meet the definition of a liability, as defined in Concepts Statement 
No. 6 (FASB 1985). The FASB has argued that fair values are the most relevant measure 
of the value of assets and liabilities and Gi in these fair values should be the basis 
for revenue recognition. 

This argument is consistent with the views of SAB No. 101 opponents, who state that 
the bulletin eliminates accepted industry revenue recognition practices that do not reflect 
an attempt to artificially boost earnings. They contend that the accounting information 
generated prior to the adoption of SAB No. 101 was more value-relevant because earnings 
that include accelerated revenue recognition are more useful in predicting future cash flows. 
For example, if shipments requiring customers’ acceptances are not artificially accelerated 
and instead reflect customer demand, then the amounts associated with these shipments at 
period end provide an indication of future sales growth.” 

Both of these arguments suggest that adoption of this Staff Accounting Bulletin affects 
the association between earnings and future cash flows and between unexpected earnings 
and earnings announcement period returns for SAB 101 firms. H the SEC is correct, then 
these associations should increase for SAB 101 firms in the post-adoption period, when 
firms will no longer be able to accelerate revenue recognition to “mask” declines in per- 
formance. If the SAB No. 101 critics are correct, then these associations should decline for 
SAB 101 firms in the post-adoption period, when firms will no longer be able to use the 
revenue recognition principles that best reflect the nature of their business. 


IV. SAMPLE SELECTION 

We use the reporting requirements associated with the enactment of SAB No. 101 to 
identify a set of firms that elected to accelerate revenue recognition. More specifically, we 
conduct a keyword search of the Lexis/Nexis SEC filing database, identifying all firms that 
report a cumulative effect adjustment to earnings resulting from SAB No. 101 adoption. 
According to the revenue recognition policies proscribed by SAB No. 101, each of these 
firms accelerated revenue recognition prior to the adoption of this standard. This search 
yields a sample of 257 firms that adjust earnings. Compustat data was unavailable for 28 
of these firms, resulting in a final sample of 229 SAB 101 firms. The two most common 
types of accelerated revenue recognition polices in our sample relate to upfront fees under 


? See Rajgopal et al. (2003) for a similar EUN regarding the value relevance of order backlog. The authors 
find that backlog amounts reported in the financial statements do provide information about future earnings. 
However, their results indicate that the stock market behaves as if backlog contributes more to future earnings 
than that implied by the cross-sectional association between backlog and future earnings. 
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license agreements and shipment of goods requiring customer acceptance. The Appendix 
provides examples of the financial statement disclosures associated with these types of 
revenue polices before and after SAB No. 101 adoption. 

Only 110 of our SAB 101 firms (48 percent) reported pro forma information related 
to the adjustment, even though APB No. 20 requires the disclosure of this information. If 
available, the pro forma data provides a measure of as-if earnings for our SAB 101 firms. 
Given the reduction in our sample that results from using only this data and the potential 
for selection bias, we identify two. sets of control firms whose earnings are unlikely to be 
affected by accelerated revenue recognition as an alternative method of measuring as-if 
earnings. 

Our first set of control firms is our SAB 101 firms in the post-adoption period. We 
argue that in the pre-period the firms affected by SAB No. 101 were accelerating revenue 
recognition, but that in the post-period they were not. The advantage of using a firm as its 
own control is that the test sample and control sample are likely to be similar along all 
dimensions other than the revenue recognition policy followed. The disadvantage is that 
tests spanning the economic downturn in 2000, the year firms adopted SAB No. 101, could 
potentially bias our tests toward finding a change in the propensity to meet benchmarks 
and a change in the value relevance of earnings. 

Our second control sample consists of firms unaffected by SAB No. 101, but otherwise 
similar to tbe SAB 101 firms. To identify these firms, we first match our SAB 101 firms 
to other Compustat firms based on their two-digit SIC code and asset size. We then verify 
that SAB 101 firms and unaffected firms engage in the same types of revenue-generating 
transactions. Specifically, we examine whether our unaffected firms report engaging in 
economic activity that generates upfront fees and customer acceptances, which are the two 
most frequent types of activities affected by this accounting change.? We determine this by 
performing a keyword search on the financial statements of our unaffected firms. Despite 
substantial differences in how firms discuss their operations, we find a keyword match for 
89 percent of the sample and identify 226 unaffected control firms. 

By employing these two control samples, we not only compare our SAB 101 firms to 
unaffected firms in the pre-adoption period, but we also incorporate difference-in- 
differences tests that compare changes in test firms relative to changes in control firms. 
This differences-in-differences research design helps to mitigate the potential impact of the 
economic downturn on our tests.? 


V. RESEARCH DESIGN 
Earnings Management Tests 


To provide evidence on our earnings management hypothesis, we examine the earnings 
distributions of SAB 101 firms and unaffected firms to test for differences in the propensity 
to meet benchmarks. We use Degeorge et al.'s (1999) hierarchy of benchmark importance. 
We first examine the likelihood of meeting a positive earnings benchmark. Conditional on 


* Of the 229 sample firms, 50 firms adjusted their financial statements for other reasons. For these firms we 
examined whetber the adjusting and unaffected firms were in the same three- or four-digit SIC code. We find 
that 37 percent are in the same three-digit SIC code and 11 percent are in the same four-digit SIC code. 

? The effectiveness of the design in reducing this problem depends on the extent to which the economic downturn 
affected adjusting and unaffected firms similarly. To determine the relative impact, we examine the effect of the 
downturn on the earnings distribution of these two samples. We find that the economic downturn had a similar 
impact on the Ist quartile, median, and 3rd quartile firms in the adjusting and unaffected samples. ` 
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the firms having positive earnings, we examine the likelihood of meeting an earnings change 
benchmark.!? 

These benchmark tests use quarterly data. We include all quarters from fiscal 1997— 
1999 in our pre-adoption period. We exclude the four quarters of fiscal 2000 from our 
comparisons because it is unclear whether they reflect the implementation of SAB No. 101. 
The post-adoption period includes the eight fiscal quarters in 2001 and 2002, and any fiscal 
quarters with data available in fiscal 2003. 

Our earnings-level tests use quarterly net income (Compustat data item 69) scaled by 
end-of-period total assets (Compustat data item 44).!! Our earnings change analysis uses 
the seasonally adjusted difference in quarterly net income scaled by end-of-period total 
assets.? In both our earnings levels and earnings changes analyses, we define the distri- 
bution bin width to be 0.75 percent. We classify firms in the first bin to the right of zero 
as just meeting the benchmark, while those in the first bin to the left of zero are classified 
as just missing the benchmark.'? 

For the SAB 101 and unaffected firm samples, we create pre-adoption period and post- 
adoption period earnings histograms. Thus, we create four histoprams of quarterly earnings 
and four histograms for changes in quarterly earnings. Our tests require that we develop a 
model of the number of firms expected to fall in each bin in the absence of earnings 
management. For each histogram, we calculate this expectation using the Burgstahler and 
Dichev (1997) method of averaging the number of observations in the bins to the right and 
to the left of the bin of interest. We then create a variable that measures the difference 
between the actual number of firms and the expected number of firms for each bin in each 
earnings and earnings change histogram (Earn. Diff, Earnchng Diff, respectively). To mea- 
sure the extent of earnings management to meet benchmarks we create an indicator variable, 
Netbin, that is 1 for the bin just to the right of a particular benchmark (i.e., zero earnings 
or same quarter last year's earnings), —1 for the bin just to the left of the benchmark, and 
0 otherwise. 

We estimate the following regressions for the SAB 101 firms, the unaffected firms, and 
the pooled sample to test whether firms that accelerated revenue recognition were more 
likely to meet earnings thresholds: 


1? We do not examine the analyst forecast benchmark, the third component of Degeorge et al.’s (1999) hierarchy, 
because we do not feel that measurement of a firm's ability to meet this benchmark is applicable to our setting. 
In the pre-period, the ongoing communication between managers and analysts makes it difficult to form infer- 
ences about what information analvsts had about accelerated revenue and the ability of firms to use revenue 
recognition policies to meet the analyst forecast benchmarks. Additionally, the adoption period for Reg FD and 
SAB No. 101 overlap. Evidence suggests Reg FD has affected manngers' ability to guide analysts' forecasts, 
and thus the firms' propensity to meet analyst benchmarks. 

H We focus on net income rather than other earnings measures such as.pre-tax income based on Burgstahler's 
(2004) discussion of Beaver et al. (2003). Specifically, he comments that bottom line earnings are likely to be 
the focus of manager's benchmark-beating behavior. The most appropriate scalar was not obvious to us, so we 
rerun all of our tests scaling by the end-of-period shares outstanding (quarterly Compustat data item 61) and 
by market value of equity (quarterly Compustat data item 61 multiplied by quarterly Compustat data item 14) 
and find qualitatively similar results. 

12 Graham et al.’s (2004) survey results suggest that CEOs focus more on the seasonally adjusted change in net 
income than on the change in net income in adjacent quarters. We find similar results using this alternative 
earnings change metric. 

!* Since bin width is an arbitrary research design choice, we investigate the sensitivity of our results to this choice. 
Using bin widths of 0.5 percent to 1 percent, we find qualitatively similar results. 
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Earn. Diff, = a + B,Post + B,Netbing , + g4Post*NetBing,, + € (1) 
Earnchng. Diff, = a + B Post + B.Netbing gt B,Post*NetBin,, p + € (2) 
where: 


Earn_Diffe = difference between the expected number of firms and the actual num- 
ber of firms for each bin b in each period t for the histogram of net 
income (quarterly Compustat data item 69) scaled by end-of-period 
assets (quarterly Compustat data item 44); the bin width is 0.75 
percent; the expected number of firms for each bin b in each period 
t is estimated using the average of the number of observations in the 
adjacent bins to the left and to the right of the bin of interest; 

Earnchng _ Diff, „ = difference between the expected number of firms and the actual num- 

| ber of firms for each bin b in each period t for the histogram of the 
seasonally adjusted change in net income (quarterly Compustat data 
item 69) scaled by the end-of-period assets (quarterly Compustat data 
item 44); the bin width is 0.75 percent; the expected number of firms 
for each bin b in each period f is estimated using the average of the 
number of observations in the adjacent bins to the left and to the 
right of the bin of interest. 

Netbing , = indicator variable equal to 1 if the firm falls in the histogram bin 
just to the right of the benchmark, —1 for the histogram bin just to 
the left of the benchmark, and 0 otherwise; 

Post — indicator variable equal to 1 if the observation is drawn from the 
post-adoption period earnings distribution, and 0 otherwise; and 
Post*Netbing , = calculated by multiplying Post by Netbin. 


We conduct t-tests on the coefficients on Post*Netbin to determine whether the pro- 
pensity to meet thresholds was affected by SAB No. 101 adoption. Using an interactive 
dummy variable, we conduct F-tests on the differences between the coefficients for SAB 
101 and unaffected firms on Netbin to test for significant differences between these two 
samples in the propensity to meet benchmarks in the post-adoption period. We also conduct 
F-tests on Post*Netbin to test for significant differences in the changes in the propensity to 
meet benchmarks for the pre- versus post-adoption period across these two samples. 

We further test our earnings management hypothesis by examining firms' earnings 
management incentives. Specifically, we compare the characteristics of the SAB 101 firms 
to the unaffected firms during the pre-adoption period. 

To proxy for the quality of a firm's corporate governance system, we focus on the 
number of shares held by outside members of the board of directors as reported in the 
firm's fiscal vear 1999 proxy statement. Consistent with Bushman et al. (2002), we classify 
a member of the board as an outsider if he or she is not a member of the management 
team, a retired member of the management team, or a family member of the manage- 
ment team of the firm. We then create a variable, Corp Gov, which is the sample rank of 
the percentage of the shares owned by the board of directors that is owned by outside 
members. 

We focus on the firm’s use of bank debt financing to proxy for the debt-contracting 
incentives to manage earnings. Begley and Freedman (2004) document that in the period 
1999-2000, public debentures do not include financial covenants. Since bank debt virtually 
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always includes financial covenants, we argue that firms with bank debt are more likely to 
be affected by financial covenants. Accordingly, we create an indicator variable, Bank Debt, 
which is 1 if the firm has bank debt, and 0 otherwise. We examine the firms' debt footnotes 
and the material contracts section of their 1999 10-Ks to determine if they have outstanding 
bank debt. 

We use the Securities Datacorp Database (SDC) of debt and equity issuances to create 
three variables that proxy for the incentives to manage earnings prior to the issuance of 
debt or equity. We measure these variables during the three-year period prior to SAB No. 
101 adoption. The first, IPO, is an indicator variable equal to 1 if the firm undertook an 
IPO during the period, and 0 otherwise. The second, Equity Issue, is an indicator variable 
equal to 1 if the firm issued equity during the period, and 0 otherwise. The third, Debt 
Issue, is an indicator variable equal to 1 if the firm issued public debt during the period, 
and 0 otherwise. 

To measure the effect of CEO bonuses as an earnings management incentive, we ex- 
amine fiscal year 1999 proxy statements to determine whether firms' bonus plans are based 
on financial performance measures. Managers of firms with accounting-based bonus plans 
are expected to have stronger incentives to manage earnings upwards. Our proxy for this 
incentive, Bonus, is an indicator variable equal to 1 if the firm's bonus plan is based on 
financial performance, and 0 otherwise. Finally, we also include Size as a control variable, 
measured as the log of assets (annual Compustat data item 6) at the end of the pre-adoption 
period. Using these proxies we run the following logistic regression: 


SAB = a + B,Corp Gov + B Bank Debt + BPO + B,Equity Issue 
+ B.Debt Issue + B,Bonus + B,Size + € (3) 


where: 


SAB = 1 for SAB 101 firms, and 0 for unaffected firms; 
Corp Gov = sample rank of percentage of board of director ownership shares that are 
held by outside directors; 
Bank Debt = equals 1 if the firm has outstanding bank debt, and 0 otherwise; 
IPO = 1 if the firm undertook an initial public offering during the three-year period 
prior to SAB No. 101 adoption, and 0 otherwise; 
Equity Issue — 1 if the firm issued equity during the three-year period prior to SAB No. 
101 adoption, and 0 otherwise; 
Debt Issue = 1 if the firm's issued public debt during the three-year period prior to SAB 
No. 101 adoption, and 0 otherwise; 
Bonus — 1 if the firm offered a cash bonus based on financial performance in the 
fiscal year prior to SAB No. 101 adoption, and 0 otherwise; and 
Size — the log of assets (annual Compustat data item 6) at the end of the fiscal 
year prior to SAB No. 101 adoption. 


Earnings Informativeness Tests 

We use two measures of performance to test our earnings informativeness hypothesis. 
Our first construct is future cash flows. In the Statement of Concepts, the FASB argues that 
"financial reporting should provide information to help investors, creditors, and others 
assess the amounts, timing, and uncertainty of prospective net cash inflows to the related 
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enterprise.” !4 Based on the FASB's argument, we assert that earnings that are more highly 
associated with future cash flows are more value-relevant. Our second construct is the 
market's response to a firm's earnings announcement. We argue that if investors perceive 
earnings to be more informative, then there will be higher earnings response coefficients 
(ERCs).! Consistent with distinctions made in Lev (1989), we choose a short window 
during which to examine the association between earnings and returns to assure that the 
price change is mostly due to earnings information, rather than examining a longer window 
that might include other information and lead us to overstate the incremental information 
contribution of earnings to returns. 

We estimate the following regressions controlling for fixed firm and quarter effects, for 
the SAB 101 firms, the unaffected firms, and the pooled sample to test whether accelerated 
revenue recognition results in earnings that are more informative:!' 


LeadCFO¿  , = a + B,Post + B;Earn,, + B,Post*Earn; + € (4) 
Returns, = a + B,Post + B,AEarn,, + B,Post*AEarn, p + € (5) 
where: 


LeadCFO,,,, = accumulated cash flow from operations (quarterly Compustat data item 
108) scaled by end-of-period assets (quarterly Compustat data item 44) 
for firm i in one, two, three, and four subsequent quarters; 

Returns, = two-day cumulative market-adjusted returns from CRSP (using the value 
weighted market index) for firm 7 on the earnings report date t and date 
t-1; 
Post — indicator variable equal to 1 if the firm-quarter observation occurs during 
the post-adoption period, and 0 otherwise; 
Earn — earnings, measured as the net income (quarterly Compustat data item 69) 
for firm i in period q, scaled by end-of-period assets (quarterly Compustat 
data item 44); 
Post*Earn,,,, = Post multiplied by Earn; 
"n, = unexpected earnings, measured as the seasonally adjusted change in net 
income (quarterly Compustat data item 69) for firm i in period q, scaled 
by end-of-period shares outstanding (quarterly Compustat data item 61); 
and 
Post*AEarn = Post multiplied by AEarn. 


We conduct t-tests on the coefficients on Post* Earn and Post*AEarn to evaluate whether 
the change in revenue recognition practices resulting from SAB No. 101 adoption signifi- 
cantly affected earnings informativeness. We conduct F-tests on the differences between 


W See SFAC No. 1, FASB (1978, para. 37). 

13 See Dechow et al. (1998) and Barth et al. (2001) for similar arguments on why the association between reported 
earnings and future cash flows is a good measure of earnings informativeness. 

16 See Lev (1989) and Kothari (2001) for a summary of the extensive research examining the association between 
a firm's earnings and stock returns. 

u We choose scalars for each performance metric based on previous research and scale the independent variables 
with the same scalar as the dependent variable. Consequently, earnings and lead cash flows are both scaled by 
assets in our cash flow analysis, while unexpected earnings are scaled by shares outstanding. As we discuss 
below, we examine the sensitivity of our results to alternative scalars. 
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the coefficients for SAB 101 and unaffected firms on Earn and AEarn to test for a signif- 
icant difference between the two samples in earnings informativeness in the pre-adoption 
period. We also conduct an F-test on Post*Earn and Post*AEarn to test for significant 
differences in the changes in informativeness for the pre-adoption versus post-adoption 
periods across these two samples of firms. Like our earnings management tests, these tests 
use fiscal 1997—1999 quarterly data for the pre-adoption period and use the eight fiscal 
quarters in 2001 and 2002, and any fiscal quarters with data available in fiscal 2003 for 
our post-adoption period. 

We perform a second set of informativeness tests using the pro forma data collected 
from the SAB 101 firm's fiscal year 2000 10-K filing to measure as-if earnings for periods 
reported prior to the adjustment. More specifically, we use the pro forma reconciliation to 
separate earnings into two parts: (1) earnings that do not include accelerated revenue rec- 
ognition (Pro Forma Earn), and (2) earnings as a result of accelerated revenue (Accelerated 
Earn). Firms that chose to report pro forma earnings presented this information for either 
fiscal years 1998 and 1999, or fiscal year 1999 only. Since the pro forma data is only 
reported for annual earnings, we regress one-year-abead cash flows and annual returns on 
Pro Forma Earn and Accelerated Earn for each year provided in the reconciliation to 
investigate whether the earnings associated with the acceleration of revenue have a statis- 
tically different association with future annual cash flows or annual returns. Our annual 
return accumulation period begins at the end of the fourth month of fiscal year t and ends 
at the end of the fourth month of fiscal year t-- 1. We estimate the following regressions: 


Annual LeadCFO,,,,,, = a + B,Pro Forma Earn, + B;Accelerated Earn; + € (6) 
Annual Returns, = a + B,Pro Forma Earn, + B,Accelerated Earn, +£ (7) 
where: 


Annual LeadCFO,,,,,, = annual cash flow from operations (annual Compustat data item 
308) scaled by end-of-period assets (annual Compustat data item 
6) for firm i in year t1; 

Annual Returns, = annual cumulative market-adjusted returns from CRSP (using the 
value-weighted market return) for firm í from end of month four 
of fiscal year t until the end of month four of fiscal year t+1; 

Pro Forma Earn, = difference between actual earnings reported for firm i in period t 
and the amount of the pro forma adjustment associated with a 
cumulative effect from an accounting change related to SAB No. 
101, scaled by end-of-period assets; and 

Accelerated Earn, = amount of earnings related to periods after fiscal year 2000 that 
had been recognized in a prior period earnings (i.e., earnings that 
had been accelerated in periods prior to fiscal year 2000) scaled 
by end-of-period assets. 


VL RESULTS 
Table 1 provides descriptive data by industry for all industries with five or more SAB 
101 firms. Specifically, we report the average adjustment size scaled by market value of 
equity, the types of adjustments made in each industry and the total number of adjustments 
in the industry. 
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The average earnings adjustment in our sample is nearly 3 percent of the market value 
of equity (MVE), while the largest average adjustment, 5.2 percent of MVE, is in the 
Management Services industry. We also find that the types of adjustments are clustered by 
industry. Specifically, non-refundable payments were most commonly recognized in the 
Pharmaceutical and Chemical, Communications, and Computer and Business Services in- 
dustries. In the Machinery and Measuring Instruments industries, it was more common to 
recognize revenue prior to final customer acceptance. The largest number of adjustments, 
43, is in the Pharmaceuticals and Chemicals industry. We also find that our sample is heavily 
concentrated in the Computer and Business Services and Measuring Instruments industries. 


Earnings Management Results 
Earnings Benchmarks 


Table 2 presents the results of estimating models (1) and (2), our earnings benchmark 
regressions. Panel À reports the results for earnings levels. For SAB 101 firms, the statis- 
tically significant positive coefficient on Netbin indicates that, in the pre-adoption period, 
SAB 101 firms are more likely to have small positive earnings and less likely to have small 
negative earnings. The statistically significant negative coefficient on Post*Netbin indicates 
that the discontinuity in the earnings distribution declines from the pre- to the post-adoption 
period. 

The coefficient on Netbin for unaffected firms is also positive and statistically signifi- 
cant. This indicates that in the pre-adoption period, unaffected firms also report more small 
positive earnings and fewer small negative earnings. In contrast to the findings for SAB 
101 firms, we find no evidence that this likelihood declined in the post-adoption period for 
unaffected firms. 

In the third column, we report the results for our pooled sample regression testing for 
a statistical difference in the SAB 101 and unaffected firms' propensity to meet benchmarks. 
The results indicate that in the pre-adoption period SAB 101 firms were more likely to 
meet benchmarks than unaffected firms. Furthermore, the reduction in the propensity to 
meet benchmarks associated with the adoption of SAB No. 101 affects SAB 101 firms 
more than unaffected firms. Together, these results are consistent with the hypothesis that 
SAB 101 firms were managing earnings in the pre-adoption period by accelerating revenue 
recognition to report fewer small losses and more small profits. 

Panel B of Table 2 reports the results from a similar analysis using the distribution of 
earnings changes for firms with positive net income. Similar to earnings levels, in the pre- 
adoption period, SAB 101 firms report more small positive and less small negative earnings 
changes than in the post-adoption period. We also find evidence that unaffected firms were 
more likely to meet the positive earnings change benchmark in the pre-adoption period; 
however, we find no evidence that this likelihood changes in the post-period. The tests of 
differences between the SAB 101 firms and the unaffected firms are not statistically sig- 
nificant. Overall, this analysis provides limited evidence that SAB 101 firms accelerated 
revenue recognition to meet the positive earnings change benchmark.!* 


I8 To ensure that our results are not driven by our method of calculating the expected number of observations in 
a bin, we repeat the analysis using an alternative method. We use nonlinear least squares to construct a curve 
that fits the histogram of each of the earnings distributions. The estimated function is used to calculate the 
expected number of firms for each bin. When we use this expectation, the results of our tests (not tabulated 
here) are qualitatively similar. 
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Incentives to Manage Earnings 

Table 3 presents univariate tests for differences in means between SAB 101 and un- 
affected firms for the earnings management incentives proxies. Due to data availability, we 
compare firm characteristics of 187 SAB 101 firms and 158 unaffected firms. We find SAB 
101 firms have weaker corporate governance (as indicated by less outsider share ownership) 
than unaffected firms, and 73 percent of SAB 101 firms have bank debt compared to 61 
percent of unaffected firms. We do not note a significant difference in the issuance of debt 
or equity securities in the three vears prior to SAB No. 101 adoption, with the exception 


TABLE 2 
Coefficients and (t-statistics) from a Regression of the Extent of the Discontinuity in the 
Earnings Distribution for a Sample of SAB 101 Firms and a Sample of Unaffected Firms 
before and after the Effective Date of SAB No. 101 


Panel À: Earnings Levels 
Earn Diff, = a + B,Post + B,Netbing , + B,Post*NetBin,, , + € 


SAB 101 
Firms Unaffected Firms Difference in 
Predicted Coefficients Coefficients Coefficients 
Variable Sign (t-statistics) (t-statistics) (F-test) 
Intercept +/— - —0.02 0.02 —0.04 
(—0.02) (0.02) (—0.03) 
Post +/— 0.02 0.05 —0.03 
(0.02) (0.03) (—0.01) 
Netbin +/— 68.625 40.75 27.88 
(14.99)*** (10.53)*** (4.65)*** 
Post*Netbin +/— —40.625 —1.75 —38.89 
(—6.27)*** (—0.32) (—4.59)*** 
Number of Obs.* 139 139 
Adjusted R? 0.66 0.60 


Panel B: Earnings Changes 
Earnchng _ Diff,, , = « + B,Post + BNetbin, , + ByPost*NetBing , + € 


SAB 101 
Firms Unaffected Firms Difference in 
Predicted Coefficients Coefficients Coefficients 
Variable Sign (t-statistics) (t-statistics) (F-test) 
Intercept +/— —0.01 —0.01 —0.01 
(—0.00) (—0.00) (—0.00) 
Post +/— 0.01 0.01 0.01 
(0.01) (0.01) (0.01) 
Netbin +/— 29.125 19.75 9.375 
(3.15)*** (1.55)* (0.60) 
Post*Netbin +/— —35.875 | 17.30 —18.375 
(—2.74)** (—0.97) (—0.86) 
Number of Obs.* 115 115 
Adjusted R? 0.05 0.01 0.16 


(continued on next page) 
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TABLE 2 (continued) 


*** ** * Indicates significance at the 1 percent, 5 percent, and 10 percent levels, respectively, based on a one- or 
two-tailed test as 
* Represents the number of histogram bins measured in the pre-period plus the number of bins measured in the 
post-period for each distribution type (earnings, earnings changes) and for eazh sample (SAB 101 and 
unaffected). 
Variable Definitions: 
Earn_Diff,,,, = difference between the expected number of firms and the actual number of firms for 
l each bin b in each period t for the histogram of net income (quarterly Compustat data 
item 69) scaled by end-of-period assets (quarterly Compustat data item 44). The bin 
width is 0.75 percent. The expected number of firms for each bin b in each period t is 
estimated using the average of the number of observations in the adjacent bins to the 
left and to the right of the bin of interest; 

Earnchng DIFF, ,, = difference between the expected number of firms and the actual number of firms for 
each bin b for each period t for the histogram of the seasonally adjusted change in net 
income (quarterly Compustat data item 69) scaled by the end-of-period assets (quarterly 
Compustat data item 44). The bin width is 0.75 percent. The expected number of firms 
for each bin b in each period t is estimated using the average of the number of 
observations in the adjacent bins to the left and to the right of the bin of interest; 

Netbing , = indicator variable equal to 1 if the firm falls in the histogram bin just to the right of the 
benchmark, —1 for the histogram bin just to the left af the benchmark, and 0 otherwise; 
Post — indicator variable equal to 1 if the observation is drawn from the post-adoption period 
earnings distribution, and 0 otherwise; and 
Post*Netbing , = calculated by multiplying Post by Netbin. 





of IPO offerings. We find that only 9 percent of the SAB 101 firms issued IPOs, compared 
to 15 percent of the unaffected firms. We find no difference in the use of financial per- 
formance bonuses in the two samples. Also, there is no significant difference in asset size 
for our two samples, indicating that our size match was effective. 

Table 4 presents the results of estimating model (3), our logistic regression comparing 
the earnings management incentives of SAB 101 and unaffected firms. We find that SAB 
101 firms have weaker corporate governance than unaffected firms. This finding suggests 
that firms with good corporate governance adopt more conservative accounting policies. 
We also find that SAB 101 firms are more likely to have bank debt. This result is consistent 
with financial covenants affecting the revenue recognition decision. 

We find no differences in the propensity of SAB 101 and unaffected firms to issue debt 
or equity securities or to go public. We also do not find a difference between the two 
samples in the use of bonuses based on financial performance. This is consistent with the 
evidence in Graham et al. (2004) that financial executives consider bonuses to be a relatively 
less important earnings management incentive. 


Earnings Informativeness Results 
Cash Flow Analysis 

Table 5 presents the results of estimating model (4), a regression of future operating 
cash flows on current earnings. Panel A reports the association between quarter q earnings 
and one-quarter-ahead cash flows. Panels B, C, and D report the results of similar analyses 
using future cash flows accumulated for two quarters, three quarters, and four quarters, 
respectively. 

In Panel A of Table 5 we find a statistically significant positive coefficient on Earn for 
SAB 101 firms indicating a positive association between earnings and one-quarter-ahead 
cash flows in the pre-adoption period. A decline in the association between earnings and 
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TABLE 3 
Univariate Tests of Differences in Mean Values of Firm Characteristics for a Sample of SAB 
101 Firms and a Sample of Unaffected Firms Matched on Size and Industry Membership 


Predicted Mean for Mean for t-statistic 
Sign of SAB 101 Unaffected Difference of 
Variable — Difference — — Firms .Firms in Means Difference 
Corp Gov = 178.80 201.70 —22.90 —1.99** 
Bank Debt + 0.73 0.61 0.12 2.33** 
IPO +T = 0.09 0.15 —0.06 —1.80* 
Debt Issue +/— 0.14 0.18 —0.04 — 1.03 
Equity Issue +f== 0.38 0.31 0.07 1.38 
Bonus + 0.81 0.84 0.03 —0.70 
Size +/— 5.51 5.56 —0.05 —0.21 
Num Obs. 187 158 
*** ** * Indicates significance at the 1 percent, 5 percent, and 10 percent levels, respectively, based on a one- or 
two-tailed test as appropriate. 

Variable Definitions: 


Corp Gov — sample rank of percentage of board of director ownership shares that are held by outside 
directors; 


Bank Debt = 1 if the firm has outstanding bank debt, and 0 otherwise; 
IPO = 1 if the firm undertook an initial public offering during the three-year period prior to SAB No. 
101 adoption, and 0 otherwise; 
Equity Issue — ] if the firn issued equity during the three-year period prior to SAB No. 101 adoption, and 0 
otherwise; 


Debt — 1 if the firm's issued public debt during the three-year period prior to SAB No. 101 adoption, 
and 0 otherwise. 
Bonus = 1 if the firm offered a cash bonus based on financial performance in the fiscal year Prowl SAB 
No. 101 adoption, and 0 otherwise; and 
Size = the log of assets (annual Compustat data item 6) at the end of the fiscal year prior to SAB No. . 
101 adoption. 


future cash flows in the post-adoption period is indicated by the negative coefficient on 
Post* Earn. 'This is consistent with the FASB's concern that SAB No. 103 has led to a 
decline in earnings informativeness. 

The second column of Panel A of Table 5 indicates that there is also a positive cor- 
relation between earnings and future cash flows for unaffected firms in the pre-adoption 
period. In contrast to SAB 101 firms, unaffected firms show no change in the association 
between earnings and future cash flows in the post-adoption period. F-tests of differences 
between the SAB 101 and unaffected firms indicate that, in the pre-adoption period, SAB 
101 firms’ earnings were more informative than earnings of the unaffected firms. In the 
post-adoption period, SAB 101 firms had a statistically significant reduction in the asso- 
ciation of earnings and future cash flows when compared to the unaffected firms’ 
association. 

We find a similar pattern in Panels B and C for two-quarter-abead and three-quarter- 
ahead cumulative cash flows. The only difference in significance between these two panels 
and Panel A is the lack of significance in Panel C on the reduction in the association for 
SAB 101 firms in the post-adoption period. However, the results in Panels B and C do 
indicate that the incremental informativeness of earnings for SAB 101 firms, compared to 
unaffected firms, deteriorates as the period of accumulation of cash flows is lengthened. In 
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TABLE 4 
Coefficients and (Chi-Squared Statistics) from a Logistic Regression Comparing the Earnings 
Management Incentives of a Sample of SAB 101 Firms and a Sample of Unaffected Firms 
Matched on Size and Industry Membership 


SAB = a  B,Corp Gov + B,Bank Debt + B4IPO + B,Equity Issue + B,Debt Issue 
+ B Bonus + BSize + € 


Predicted Coefficient 
Variable Sign (Chi-Squared) 
Intercept +/— 0.53 
(1.46) 
Corp Gov = — 0.002 
(—4.10)** 
Bank Debt + 0.66 
(7.01)*** 
IPO +/— —0.51 
(—2.13) 
Equity Issue +/— 0.02 
(0.01) 
Debt Issue +/— —0.27 
(—0.58) 
Bonus + —0.27 
(—0.85) 
Size +/— —0.02 
(—0.06) 
Number of Obs. 345 
Pseudo R? 3.5% 
Percent Correctly 61.8% 
Predicted 
*** ** * Indicates significance at the 1 percent, 5 percent, and 10 percent levels, respectively, based on a one- or 
two-tailed test as appropriate. 
Variable Definitions: 


SAB = 1 for SAB 101 firms, 0 for unaffected firms; 
Corp Gov — sample rank of percentage of board of director ownership shares that are held by outside 
directors; 


Bank Debt = 1 if the firm has outstanding bank debt, and 0 otherwise; 
IPO = 1 if the firm undertook an initial public offering during the three-year period prior to SAB No. 
101 adoption, and 0 otherwise; 
Equity Issue = 1 if the firm issued equity during the three-year period prior ta SAB No. 101 adoption, and 0 
otherwise; 
Debt Issue = 1 if the firm's issued public debt during the three-year period prior to SAB No. 101 adoption, 
and 0 otherwise; 
Bonus = 1 if the firm offered a cash bonus based on financial performance in the fiscal year prior to SAB 


No. 101 adoption, and 0 otherwise; and 
Size = the log of assets (annual Compustat data item 6) at the end of the fiscal year prior to SAB No. 


101 adoption. 


Panel D, we find that in the pre-adoption period, SAB 101 firms' earnings are more infor- 
mative than unaffected firms, but SAB No. 101 adoption does not have a marginal impact 
on SAB 101 firms. 

Collectively, these results demonstrate that accelerating revenue recognition results in 
earnings that are more informative about future cash flows. Restrictions on the acceleration 
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TABLE 5 


391 


Coefficients (t-statistics) from a Firm and Quarterly Fixed-Effects Regression of the 
Association between Earnings and Cash Flows Accumulated One Quarter Ahead, 
Two Quarters Ahead, Three Quarters Ahead, and Four Quarters Ahead for a Sample of 
SAB 101 Firms and a Sample of Unaffected Firms 


LeadCFO, ,,,, = a + B Post + B,Earn, , + B,Post*Earnao +e 


Predicted 
Variable Sign 
Panel A: Cash Flows, ., 
Intercept Tp 
Post v 
Earn Ti 
Post*Earn t= 
Number of Obs. 
Adjusted R? 
Panel B: Cash Flows, ,, 
Intercept Tf 
Post tfe 
Earn +i 
Post*Earn += 
Number of Obs. 
Adjusted R2 
Panel C: Cash Flows,.s 
Intercept += 
Post +/— 
Earn ph 
Post*Earn qe 
Number of Obs. 
Adjusted R? 


0.45 
(17.35)*** 
—0.13 

(—3.58)*** 

4325 
0.10 


0.001 
(0.23) 
—0.002 
(—0.62) 
0.38 
(16.51)*** 
—0.07 
(—2.24)** 
4110 
0.10 


0.29 
(13.89)*** 
—0.04 

(—1.24) 

3900 

0.10 


0.23 
(10.87)*** 
0.03 
(0.94) 
3716 
0.06 


0.21 


Difference in 


Coefficients 


(F-test) 


—0.16 
(—3.31)*** 


—0.11 
(=2.02)*** 


0.001 
(0.73) 
0.002 
(0.53) 
0.12 
(4.77) *** 
—0.07 
(—1.64)* 


(continued on next page) 
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TABLE 5 (continued) 


SAB 101 Unaffected 
Firms Firms Difference in 
Predicted Coefficients Coefficients Coefficients 
Variable Sign (t-statistics) (t-statistics) (F-test) 
Panel D: Cash Flows,,, 
Intercept T 0.000 0.00 0.00 
(0.10) (0.51) (0.43) 
Post +/— —0.000 —0.002 0.002 
(—0.05) (—0.91) (0.55) 
Earn BI 0.27 0.13 0.14 
(13.21)*** (9.12)*** (5.79)*** 
Post*Earn *f— 0.07 0.02 0.03 
(2.05)** (0.99) (0.86) 
Number of Obs. 3708 3165 
Adjusted R? 0.09 0.04 
*** ** * Indicates significance at the 1 percent, 5 percent, and 10 percent levels, respectively, based on a one- or 
two-tailed test as appropriate. 
Variable Definitions: 


LeadCFO, ,., = accumulated cash flow from operations (quarterly Compustat data item 108) scaled by end-of- 
period assets (quarterly Compustat data item 44) for firm i in one, two, three, and four 


subsequent quarters; 
Post = indicator variable equal to 1 if the firm quarter observation occurs during the post-adoption 
period, and 0 otherwise; 
Earn, = earnings, measured as the net income (quarterly Compustat data item 69) for firm / in quarter 
q, scaled by end-of-period assets (quarterly Compustat data item 44); and 
Post*Earn, , = Post multiplied by Earn. 


of revenue recognition imposed by SAB No. 101 reduced SAB 101 firms' earnings inform- 
ativeness, but did not impact unaffected firms. These findings are consistent with the FASB's 
belief that unearned revenues are informative. 


Returns Analysis 

Table 6 provides the results of estimating model (5), our regression of earnings an- 
nouncement period returns on unexpected earnings. These ERC test findings corroborate 
the results from the cash flow analysis. For SAB 101 firms, we find a positive and statis- 
tically significant coefficient on AEarn, indicating that the market reacts to unexpected 
earnings information. We also find a negative and statistically significant coefficient on 
Post*AEarn, indicating that the informativeness of unexpected earnings declines during the 
post-adoption period. 

For unaffected firms, we also find a significant positive coefficient on AEarn, indicating 
that unexpected earnings are associated with returns for these firms. However, the coefficient 
on Post*AEarn is not statistically significant, indicating no discernable change in the as- 
sociation between unexpected earnings and returns in the post-period. 

The F-tests, reported in column 3, indicate that in the pre-adoption period SAB 101 
firms had more informative unexpected earnings than unaffected firms. In addition, we find 
that SAB No. 101 adoption is associated with a significantly larger reduction in unexpected 
earnings informativeness for SAB 101 firms than for the unaffected firms. 
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TABLE 6 
Coefficients (t-Statistics) from a Firm and Quarterly Fixed-Effects Regression of Earnings 
Announcement Period Returns on Unexpected Earnings for a Sample of Firms Affected by 
SAB No. 101 and a Sample of Unaffected Firms 


Returns, = a + B,Post + B,AEarng p + BPost*AEarn,,, + € 


SAB 101 Unaffected 
Firms Firms Difference in 
Predicted Coefficients Coefficients Coefficients 
Variable Sign (t-statistics) (t-statistics) (F-test) 
Intercept +/— — 0.002 — 0.000 —0.001 
(—0.78) (—0.12) (—0.44) 
Post +/— —0.000 — 0.000 0.002 
(—0.07) (—0.13) (0.07) 
AEarn +/— 0.02 0.01 0.011 
(5.61)*** (1.98)** (1.94)** 
Post*AEarn +/— —0.01 0.003 —0.014 
(—1.81)* (0.62) (—1.70)* 
Number of Obs. 3091 2812 
Adjusted R2 0.01 0.003 
*** ** * Indicates significance at the 1 percent, 5 percent, and 10 percent levels, respectively, based on a one- or 
two-tailed test as appropriate. 
Variable Definitions: 


Returns) = two-day cumulative market-adjusted returns from CRSP (using the value weighted market 

index) for firm i on the earnings report date t and date t— 1; 
Post = indicator variable equal to 1 if the firm quarter observation occurs during the post-adoption 

period, and 0 otherwise; 

AEarn, ,, = unexpected earnings, measured as the seasonally adjusted change in net income (quarterly 
Compustat data item 69) for firm i in period q, scaled by the end-of-period shares outstanding 
(quarterly Compustat data item 61); and 

Post*AEarn,,, = Post multiplied by AEarn. 


We conduct several different sensitivity analyses to ensure that our earnings informa- 
tiveness results are not driven by our research design choices.? In our returns analysis, we 
increase the return window to five days and to ten days, and the results remain qualitatively 
the same (although the effect is weaker). We also scale unexpected earnings by assets 
(quarterly Compustat data item 44), define earnings as earnings excluding extraordinary 
items (quarterly Compustat data item 76), and as income from operations (quarterly Com- 
pustat data item 21) and the results remain unchanged. 


Pro Forma Earnings 

In Table 7, we report the results of models (6) and (7), our regressions of future annual 
cash flows and annual returns on pro forma earnings and accelerated earnings (the difference 
between previously reported and pro forma earnings). The first column of results indicates 
that both pro forma earnings and accelerated earnings are associated with future cash flows. 


19 To ensure that our results are not driven by outliers, we winsorize all variables in the reported regressions at 
five standard deviations. Truncating at the same level produces similar results. Similar results are also obtained 
when we use three standard deviations to winsorize the data. The results without winsorizing are similar to 
those reported. 
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TABLE 7 
Coefficients and (t-Statistics) from Regressions of Annual One-Year-Ahead Cash from 
Operations and of Annual Returns on Pro Forma Earnings and Accelerated Earnings for 
SAB 101 Firms 


Annual LeadCFO,,,,, = a + B,Pro Forma Earn,, + B,Accelerated Earn, + € 
Annual Returns, = a + B,Pro Forma Earn, + B,Accelerated Earn, + € 


Predicted Annual Annual 
Variable Sign LeadCFO Returns 
Intercept +/— 0.004 0.36 
(0.24) (3.56) 
Accelerated +/— 0.41 1.63 
Earn (1.88)* (1.70)* 
Pro Forma Earn +/— 0.58 0.57 
(11.07)*** (1.67)* 
Number of Obs.* 184 151 
Adjusted R? 0.396 0.007 


*** ** * Indicates significance at the 1 percent, 5 percent, and 10 percent levels, respectively, based on a one- or 

two-tailed test as appropriate. 

* Of the 110 firms that provided pro forma data, approximately 80 percent provided pro forma earnings for all 
periods affected by the.accounting change, and 70 percent had one-year ahead returns data available for each 
pro forma period presented. Consequently, our pro forma tests use 184 observations for the cash flow analysis 
and 151 for the returns analysis. 

Variable Definitions: 

Annual LeadCFO,,,,, = annual cash flow from operations (annual Compustat data item 308) scaled by end-of- 

period assets (annual Compustat data item 6) for firm i in year t+1; 

Annual Returns, = annual cumulative market-adjusted returns from CRSP (using the value-weighted 
market return) for firm i from end of month four of fiscal year t until end-of-month 
four of fiscal year r+ 1; 

Pro Forma Earn, = difference between actual earnings reported for firm i in period t and the amount of the 
pro forma adjustment associated with a cumulative effect from an accounting change 
related to SAB No. 101, scaled by end-of-period assets; and 

Accelerated Earn,,, = amount of earnings related to periods after fiscal year 2000 that had been recognized in 
a prior period earnings (i.e., earnings that had been accelerated in periods prior to 
fiscal year 2000) scaled by end-of-period assets. 


Furthermore, there is no statistically significant difference between the coefficients on these 
two variables (not tabulated), suggesting that the accelerated revenue targeted by SAB No. 
101 and other revenue reported by these firms have similar associations with future cash 
flows. These results are consistent with the cash flow results reported in Table 5. 

In the second column of Table 7, we find a significant association between pro forma 
earnings and annual returns, and accelerated earnings and annual returns. However, we find 
no statistically significant difference between the coefficients on accelerated versus pro 
forma earnings.” | 

Overall, the results of the cash flow, ERC, and pro forma tests provide evidence that 
SAB No. 101 adoption is associated with a reduction in the informativeness of earnings. 
These results are consistent with the FASB's argument that revenues recognized prior to 


20 We use a long return window that begins on the first day of the fifth month of fiscal year t and ends at the end 
of the fourth month of fiscal year t-- 1 in order to incorporate the market’s response to earnings on the earnings 
announcement date, on the date of the press conference discussing earnings, or on the 10-K filing date. When 
we begin our accumulation period at the end of the third month of period t and finish at the end of the third 
month of period t+ 1, our results are quantitatively the same. 
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completion of the earnings process provide relevant information about future performance 
resulting in more informative earnings. 
Supplementary Analysis 

The results of our earnings management tests suggest that some of the SAB 101 firms 
chose to accelerate revenue recognition to manage earnings. Despite this finding, the results 
of our earnings informativeness tests suggest that, on average, accelerated revenues are 
value relevant. However, this average result is not necessarily applicable for firms that 
accelerate revenue recognition to manage earnings. That is, firms could be managing earn- 
ings either to **mask true performance” or to provide value-relevant information to investors 
and thus make earnings more informative. 

To provide evidence on these two alternatives, we conduct a supplemental analysis that 
uses the predicted probabilities from our logistic regression to partition our SAB 101 firms 
into those with higher versus those with lower incentives to accelerate revenue recognition 
to manage earnings. We interact this partitioning variable with the earnings variable in 
the cash flow and ERC models to try to determine whether the extent of firms' earnings 
management incentives affects the informativeness of their earnings. We employ the fol- 
lowing regressions for the pooled sample, controlling for fixed firm and quarter effects: 


LeadCFO,,..,; = a + B,Post + B,SAB + B,Post*SAB + B,High*SAB 
t B4Low*SAB + B,High* SAB* Post + B,Low*SAB*Post 
+ BsEarn, , + ByPost*Earn, ,. + Q,oNoincent* Earn 
+ B,,Noincent*Post*Earn + B,,High*Earn 
t B,,High*Post* Earn + B,,Low*Earn + Bj4Low*Post*Earn + € (8) 


Returns,» = a + B Post + B,SAB + B,Post*SAB + B,High*SAB + B,Low*SAB 
+ B ¿High*SAB*Post + B,Low*SAB*Post + B,AEarn, g 
+ ByPost*AEarn, p + Bj9Noincent* AEarn, o 
+ B,,Noincent* Post* AEarn, , + By,High* AEarn, p 
+ B,,High*Post* AEarn,, + Bi4Low*AEarn, g 
+ BisLow*Posr*AEarn,,, + € (9) 


where: 


LeadCFO,,.:,; = cash flow from operations (quarterly Compustat data item 108) scaled by 
| assets (quarterly Compustat data item 44) for firm i in quarter q+1; 
Returns, = two-day cumulative market adjusted returns from CRSP (using the value 
weighted index return as our measure of the market's return) for firm i 
on the earnings report date t and date :—1; 
Post — indicator variable equal to 1 if the firm quarter observation occurs during 


the post-adoption period, and 0 otherwise; 


21 Since there are firms in our sample that are missing data necessary for the logistic analysis, we code these firms 
as having unknown incentives to manage earnings and create a dummy variable (Noincent) to measure the 
informativeness of earnings for these firms. 
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SAB = indicator variable equal to 1 for SAB No. 101 firms, and 0 otherwise; 

High = indicator variable equal to 1 for SAB No. 101 firms with above the 
median-fitted value from the logistic regression of earnings management 
incentives, and 0 otherwise; 

Low = indicator variable equal to 1 for SAB No. 101 firms with below the 
median-fitted value from the logistic regression of earnings management 
incentives, and 0 otherwise; 

Noincent = indicator variable equal to 1 for SAB No. 101 firms missing data nec- 
essary to estimate logistic regression of earnings management incentives, 
and O otherwise; 

Earn = earnings, measured as the net income (quarterly Compustat data item 69) 
for firm i in quarter q, scaled by the end-of-period assets (quarterly Com- 
pustat data item 44); and 

AEarn,,, = Unexpected earnings, measured as the seasonally adjusted change in net 
income (quarterly Compustat data item 69) for firm i in quarter q, scaled 
by the end-of-period shares outstanding (quarterly Compustat data item 
61). 

* — [ndicates multiplication. 


The results of this supplemental analysis are reported in Table 8. Panel À provides 
associations between earnings and one-quarter-ahead cash flows. The statistically significant 
coefficients on the interaction of Earn with each earnings management indicator variable 
(High, Low, and Noincent) indicate a positive association between earnings and one-quarter- 
ahead cash flows in the pre-adoption period. This suggests that regardless of firms' earnings 
management incentives, there is a higher association between earnings and future cash flows 
when firms accelerate revenue recognition. Our results also indicate that, on average, in the 
post-adoption period SAB 101 firms with both high and low earnings management incen- 
tives experience a greater reduction in the ability of earnings to predict future cash flows 
than do unaffected firms. 

These results are consistent with Subramanyan's (1996) argument that some firms man- 
age earnings to provide value-relevant information to shareholders. They also suggest that 
the FASB's argument that accelerating the recognition of unearned revenue increases the 
value relevance of earnings holds for firms that have high incentives to manage earnings. 

Panel B of Table 8 provides the results of our supplemental analysis when we use 
returns as the metric for earnings informativeness. We find that SAB 101 firms with high 
earnings management incentives have more informative unexpected earnings than do un- 
affected firms. These high-incentive SAB 101 firms also experience a significant deterio- 
ration in the association between unexpected earnings and announcement period returns 
in the post-adoption period. Like the cash flow tests, these results are consistent with 
Subramanyan's (1996) signaling hypothesis. However, in contrast to the cash flow tests, we 
do not find similar results for the Jow-incentive firms. Thus, even though our supplemental 
cash-flow tests suggest that earnings of low-incentive firms are more informative than the 
earnings of unaffected firms, the market does not appear to have a stronger reaction. It is 
difficult to draw any conclusions from the mixed results on the low-incentive firms. 

Taken as a whole, the results from these supplemental analyses provide limited evidence 
that the earnings numbers reported by SAB 101 firms prior to SAB No. 101 adoption were 
more informative than earnings in the post-period, regardless of the earnings management 
incentives. 
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TABLE 8 
Coefficients and (t-statistics) from a Firm and Quarterly Fixed-Effects Regression of One- 
Quarter-Ahead Cash Flows on Earnings for a Sample of Unaffected Firms and a Sample of 
SAB 101 Firms Partitioned on Earnings Management Incentives 


Panel A: CFO Analysis 


LeadCFO,,,.:1, = @ + B,Post + B4SAB + B,Post*SAB + B,High*SAB + B,Low*SAB 
+ B High*SAB*Post + B,Low*SAB*Post + B,Earn,, + BsPost*Earna 
+ B,oNoincent* Earn + B,,Noincent*Post*Earn + B ,High*Earn 
+ Bi High*Post*Earn +B,,Low*Earn + B,;Low*Post*Earn + € 


Variable Predicted Sign Coefficient/(t-stat) 
Intercept +/— —0.010 
(—1.97) 
Post += — 0.009 
(— 1.43) 
SAB Te —0.004 
(—0.50) 
Post*SAB +/— 0.015 
(1.00) 
High*SAB +/— —0.012 
(—2.14)** 
Low* SAB +/- 0.012 
(1.46) 
High*SAB* Post Tie 0.036 
(3.61)*** 
Low*SAB* Post +/— —0.033 
(—2.34)*** 
Earn + 0.27 
(8.]0)*** 
Post*Earn +/— 0.019 
(0.34) 
Noincent*Earn +/— 0.249 
(2.52)*** 
Noincent*Post*Earn += 0.132 
(0.66) 
High* Earn +/— 0.299 
(5.11)*** 
High*Post*Earn +/— —0.236 . 
(—2.76)*** 
Low* Earn +/- 0.451 
(5.15)*** 
Low* Post*Earn +/— — 0.225 
(—1.75)** 


(continued on next page) 
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TABLE 8 (continued) 
Panel B: Returns Analysis 


Returns,, = a + B,Post + B,SAB + B,Post*SAB + B,High*SAB + B,Low*SAB 
+ B.High*SAB*Post + B,Low*SAB*Post + B.AEa mq, + B,Post*AEarn,,, 
+ B,oNoincent* AEarn, , + Bj, Noinceni* Post*AEarn, p + Bj,High*AEarn,, g 
+ Bj High*Post*AEarn, t Bi4Low*AEarny p + BisLow*Post*AEarny p + € 


Variable Predicted Sign Coefficlent/ (t-stat) 
Intercept +/— ` —0.001 
(—0.41) 
Post +/— —0.00 
(—0.14) 
SAB +/— 0.002 
(0.33) 
Post*SAB +/— | —0.009 
(—0.97) 
High*SAB +/— —0.002 
(— 0.68) 
Low*SAB +/— 0.003 
(0.58) 
High*SAB* Post +/— 0.003 
(0.70) 
Low*SAB* Post +/— —0.004 
(—0.57) 
AEarn + 0.009 
(1.95)** 
Post*AEarn te 0.003 
(0.63) 
Noincent* AEarn +/— 0.015 
| (1.70)** 
Noincent*Post*AEarn +/— —0.032 
(—1.79)** 
High*AEarn ques 0.01 
(2.17)** 
High* Post* AEarn +/— —0.021 
(—2.19)** 
Low*AEarn +/— —0.002 
(— 0.26) 
Low*Post* AEarn +/— 0.01 
(0.91) 


*** ** * Indicates significance at the 1 ume 5 percent and 10 percent levels, respectively, based on a one- 
or two-tailed test as appropriate 
Variable Definitions: 
Returns, = two-day cumulative market adjusted returns from CRSP (using the value-weighted index return 
as our measure of the market’s return) for firm i on the earnings report date z and date r— 1; 
LeadCFO,,i, = cash flow from operations (quarterly Compustat data item 108) scaled by assets (quarterly 
Compustat data item 44) for firm i in quarter q+1; 
Post = indicator variable equal to 1 if the firm quarter observation occurs during the post-adoption 
period, and 0 otherwise; 
SAB = indicator variable equal to 1 for SAB 101 firms, and 0 otherwise; 


(continued on next page) 
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TABLE 8 (continued) 


High = indicator variable equal to 1 for SAB 101 firms with above the median fitted value from the 
logistic regression of earnings management incentives, and 0 otherwise; 
Low = indicator variable equal to 1 for SAB 101 firms with below the median fitted value from the 
logistic regression of earnings management incentives, and 0 otherwise; 
Noincent — indicator variable equal to 1 for SAB 101 firms missing data necessary to estimate logistic 
regression of earnings management incentives, and 0 otherwise; 
Earn, = earnings, measured as the net income (quarterly Compustat data item 69) for firm i in quarter 
q, scaled by the end-of-period assets (quarterly Compustat data item 44); 
AEarn,,, = unexpected earnings, measured as the seasonally adjusted change in net income (quarterly 
Compustat data item 69) for firm i in quarter q, scaled by the end-of-period shares outstanding 
(quarterly Compustat data item 61); and 
= indicates multiplication. 


VIL CONCLUSIONS 

In this study we examine a sample of firms that were required to adjust their earnings 
because of SAB No. 101, an accounting bulletin issued to clarify the SEC's interpretation 
of revenue recognition rules. Specifically, we compare the pre-adoption earnings properties 
of a sample of SAB 101 companies to their post-adoption earnings properties, and we 
compare the SAB 101 firms to unaffected firms that are in the same industry, have similar 
asset sizes, and engage in the same types of revenue generating activities as the SAB 101 
firms. 

The SEC argues that improper revenue recognition is the single largest cause of finan- 
cial reporting problems and that firms use premature revenue recognition to meet Wall 
Street expectations. Based on the SEC's argument, we consider whether firms that accel- 
erated revenue recognition were more likely to meet earnings benchmarks and had greater 
earnings management incentives. 

Critics of SAB No. 101 argue that the bulletin resulted in changes in GAAP that made 
the financial reports of the firms affected by SAB No. 101 less informative. The FASB 
echoed this concern in justifying its ongoing revenue recognition project by arguing that 
revenues recorded prior to the completion of the earnings process contain value-relevant 
information about future performance. We examine this possibility by comparing the as- 
sociations between earnings and cash flows and unexpected earnings and announcement 
period returns for SAB 101 and unaffected firms in the periods prior and subsequent to 
SAB No. 101 adoption. 

We find evidence that firms recording SAB No. 101 adjustments were more likely to 
meet important earnings benchmarks in the pre-adoption period than in the post-adoption 
period, and that SAB 101 firms were more likely than unaffected firms to meet earnings 
benchmarks. However, we also find that the association between earnings and next period 
cash flows is higher for SAB 101 firms than for unaffected firms and that the association 
between earnings and future cash flows declines for SAB 101 firms in the post-adoption 
period. We find similar results in our analysis of the relationship between unexpected earn- 
ings and announcement period returns. 

We also find that, when we partition firms that accelerate revenue recognition into 
groups that have strong and weak incentives to manage earnings, the earnings of firms with 
the strongest incentive to manage earnings have more informative earnings than firms un- 
affected by the regulation. This result is consistent with the FASB's assertion that the 
relevance of reported earnings is, on average, greater when revenue recognition is not 
delayed until the completion of the earnings process, even when earnings management 
incentives are high. 
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The results of this study provide information of particular interest to practitioners, 
regulators, and standard setters who, when developing and implementing revenue recog- 
nition standards, must consider the trade-offs between discretion in reporting revenues prior 
to the completion of the earnings process, and the informativeness of unearned revenues. 


APPENDIX 
Panel A: License Agreement—Millennium Pharmaceutical 


Revenue recognition policy prior to SAB No. 101: 

“The Company recognized licensing revenue when underlying performance criteria for payment 
had been met and when the Company had an unconditional right to such payment. Depending on a 
license agreement's terms, recognition criteria may have been satisfied upon achievement of 
milestones, passage of time or product sales by licensee." 


Revenue recognition policy after SAB No. 101 adoption: 

“Under SAB No. 101, the Company recognizes revenue from licensing agreements using a 
contingency-adjusted performance model. Under this method, revenue related to up-front, time- 
based, and performance-based licensing payments are recognized over the entire contract 
performance period ... Payments received by the Company in excess of amounts earned are 
classified as deferred revenue." 


Financial Statement Effects of Adjustment (000s) 
1998 





1999 2000 
Earnings 10,338 (379,904) (201,927) 
Cumulative Effect (107,692) 
Earnings after Cumulative Effect (309,619) 
Pro forma Earnings (10,461) (417,147) 
Amount of earnings accelerated 20,799 37,243 


Panel B: Installment and Customer Ácceptance—Integrated Measurement Systems 


Change in Revenue Recognition Policy: 

“In the fourth quarter of 2000, we implemented SAB No. 101, effective January 1, 2000, by 
changing the timing of revenue recognition for system sales from the historical industry practice of 
when product is shipped (title passes) to when a customer has accepted a shipped product. This 
change in our revenue recognition policy has been accounted for as a change in accounting 
principle effective January 1, 2000. We have recorded an after-tax non-operating charge against net 
income in the amount of $7.3 million reflecting the deferral of revenue net of related expenses for 
shipments of our products previously reported as revenue prior to January 1, 2000 that had not 
been accepted by customers as of December 31, 1999. Billings in advance of revenue recognition 
are recorded as deferred revenues and the related costs are recorded as inventory at customer sites 
and as deferred commissions." 


Financial Statement Effects of Adjustment (000s) 
1998 


1998. . 1999 — .200 — 
Earnings (3,331) 5,851 10,317 
Cumulative Effect (7,317) 
Earnings after Cumulative Effect 3,000 
Pro forma Earnings (3,850) 3,033 
Amount of earnings accelerated 519 2,818 
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ABSTRACT: Holthausen and Verrecchia's (1990) and Kim and Verrecchia’s (1997) the- 
oretical models predict that private Information inferred at the time of an earnings an- 
nouncement (private event-period information) is associated with greater trading 
volume. We provide empirical evidence consistent with these theories. Specifically, an- 
nouncements that increase analysts’ private information (as measured by Barron et 
al.'s [1998] empirical proxies) are associated with increased trading volume, consistent 
with some investors similarly acquiring private event-period information. In additlon, 
anriouncements that decrease analysts' consensus are associated with more trading 
volume. Because consensus declines when private informatlon Increases, this finding 
provides reinforcing evidence that Investors trade following earnings announcements 
because of private information that becomes useful only in conjunction with the Infor- 
mation in the announcement and that this Information is important enough to spur 
trading. 
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L INTRODUCTION 
rading volume is the most observable indicator that financial accounting information 
affects investors (Cready and Hurtt 2002). This article reports empirical evidence 
concerning the descriptive validity of two related theoretical models of such trading 
volume: Kim and Verrecchia (1997) (hereafter KV) and Holthausen and Verrecchia (1990) 
(hereafter HV). These models predict that public announcements, e.g. earnings announce- 
ments, may lead investors to disagree about firm value and thet this disagreement is suffi- 
ciently material to stimulate trading. We provide empirical evidence consistent with KV's 
prediction that earnings announcements stimulate the development of private event-period 
information, which spurs trading. In addition, our evidence is consistent with the related 
HV prediction that consensus-decreasing earnings announcements lead to trading volume 

that is not evident around other earnings announcements. 

KV and HV are important advances in the theoretical literature because they move 
beyond early models that assumed public information simply substitutes for private infor- 
mation (e.g. Kim and Verrecchia 1991). For example, in KV's model, investors trade 
around public information releases as a result of three types of information: (1) private 
information gathered in anticipation of the announcement (pre-announcement information); 
(2) public information contained in the announcement itself (value-relevant information); 
and (3) private event-period information. This third type of information allows for a com- 
plementary relation between public and private information, in that public announcements 
spur investors to develop private event-period information that leads investors to act (i.e., 
trade). Prior research has shown that trading volume is associated with the first two types 
of information (Bamber et al. 1997; Beaver 1968), but to date there is no empirical evidence 
on whether the third type of information spurs trading.! 

KV define private event-period information as new information arising from an inter- 
action between information contained in the public announcement and private information 
gathered elsewhere that becomes useful only in conjunction with the announcement itself. 
Barron et al. (2002, 824—825) provide the following example of private event-period 
information: 


For example, consider two analysts. The first is a Certified Public Accountant who 
understands when and how specific accruals will reverse and affect future earnings. 
The second, a former political economist, correctly expects that back-orders from a 
Brazilian customer will soon decline and decrease earnings. Neither of these analysts 
can use his or her indirect knowledge in isolation. Yet both can combine their knowl- 
edge with reported earnings to produce “uniquely privately inferred information.” 


Intuitively, the information contained in the earnings announcement enables analysts 
and investors to capitalize on their private information to draw new inferences regarding 


! Prior empirical research has examined other theoretical predictions about how disagreement causes trading 
volume. For example, Ziebart (1990) finds that as dispersion in analysts' forecasts increases trading volume 
following earnings announcements increases. But dispersion in analysts' forecasts reflects both uncertainty and 
consensus, and both increases in uncertainty and decreases in consensus could lead to trading (Dontoh and 
Ronen 1993). Thus, it is unclear from Ziebart (1990) whether the consensus or the uncertainty component of 
the change in dispersion (or both) is associated with trading following earnings announcements. Atiase and 
Bamber (1994) use the level of forecast dispersion as a measure of pre-disclosure information asymmetry but 
they do not investigate asymmetry in event-period information. Similarly, Bamber et al. (1997) find that earnings 
announcements leading to differential forecast revisions also lead to higher trading. But as they point out, and 
our evidence confirms, differential forecast revisions are likely attributable to investors having pre-disclosure 
information of differential precision. Whether trade following earnings announcements is associated with private 
event-period information remains unresolved. 
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the value of the firm. Although Barron et al. (2002) provide empirical evidence that earnings 
announcements lead analysts to develop private event-period information in a manner con- 
sistent with KV's theory, there is no empirical evidence that private event-period informa- 
tion is sufficiently important to lead investors to trade. 

In a precursor to KV, HV derives a trading volume effect termed the "consensus" 
effect, which is similar to the private event-period information effect in KV. Investor con- 
sensus is the commonality of investors’ information as gauged by the correlation of their 
forecast errors. HV predicts that volume will increase as consensus decreases. HV is more 
rudimentary than KV in that HV is silent on the source of (or reason for) the decrease in 
consensus. However, both theories make a similar prediction if consensus decreases as 
private information increases.” In such a case, a decrease in consensus following an earnings 
announcement indirectly reflects the presence of private event-period information. 

Drawing on the Barron, Kim, Lim, and Stevens (1998) (hereafter BKLS) empirical 
proxies for analysts’ private information and consensus (described in more detail later in 
the article), this study provides empirical evidence supporting the KV and HV predictions 
that investors trade when earnings announcements trigger the development of private in- 
formation and decrease consensus. The BKLS proxy for private event-period information 
relies on more stringent assumptions than the BKLS consensus measure? Consequently, 
obtaining similar inferences with both measures provides reinforcing evidence that investors 
trade following earnings announcements based on private information that becomes useful 
only in conjunction with the information in the announcement. Additionally, if the HV and 
KV theories are, in fact, valid, then our evidence is consistent with analysts' information 
serving as a proxy for some sophisticated investors' information. 

An important consequence of the existence of trade based on private event-period in- 
formation is its potential to increase a firm's cost of capital. In related work using BKLS 
empirical measures, Botosan et al. (2004) find that the equity cost of capital increases with 
the precision of analysts' private information. Barron et al. (2002) find that earnings an- 
nouncements trigger the release of a large percentage of this private information. As Kim 
and Verrecchia (1994, 1997) contend, it is trade based on this type of private information 
that adversely affects firms' costs of capital. Thus, our evidence contributes to our under- 
standing of how earnings announcements affect both investors' trading decisions and, po- 
tentially, firms' cost of capital. 

In sum, we show that some investors trade around earnings announcements in a manner 
consistent with their possessing private event-period information. The results support the 
descriptive validity of the KV/HV theoretical predictions, including the somewhat coun- 
terintuitive notion that earnings announcements can lead to more private information, which, 
in turn, reduces consensus about firm value and is sufficiently material to spur investors to 
trade. In addition to supporting theoretical predictions that are somewhat contrary to con- 
ventional wisdom, the results also have important implications for firm managers and for 
accounting policymakers seeking to level the informational playing field. Specifically, if a 
significant amount of trade on private information occurs when public disclosure peaks, 
then some unsophisticated traders are likely at a greater disadvantage. Consistent with 
Botosan et al. (2004), this uneven informational playing field may increase firms' cost of 
capital. 


2 Empirical tests reported later in this article confirm that our proxies for consensus and private event-period 
information are significantly negatively correlated. Botosan et al. (2004) also document a significant negative 
relation between the precision of analysts' private information and consensus. 

3 See Barron et al. (2002) for evidence concerning the reasonableness of the BKLS assumptions. 
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The paper proceeds as follows: Section II describes the two theoretical models of 
trading and develops the hypotheses. Section III specifies the empirical models and then 
describes the variable measurement. Section IV describes the sample and discusses the 
results. Section V provides sensitivity analyses that serve to link this study to prior empirical 
research. Section VI concludes the paper. 


Il. THEORETICAL BASES AND HYPOTHESES 

Our study tests the predictions of two widely cited theories of how information affects 
trading volume: HV and KV. Because of differences in their assumptions, these two ana- 
lytical models are not directly comparable; however, both models make similar predictions 
about how private event-period information will affect trading volume around public an- 
nouncements (e.g., earnings announcements). 

In KV, public announcements stimulate three kinds of information-related trading. First, 
some trading arises because investors have private pre-disclosure information of differential 
precision. Investors with more precise private information place more reliance on that in- 
formation and less reliance on the earnings announcements. Thus, there is more (less) 
differential belief revision around earnings announcements with more (less) differential 
precision, and this differential belief revision stimulates more (less) trading (see also Kim 
and Verrecchia 1991). 

Second, when the precision (amount) of investors' private pre-disclosure information 
differs, earnings announcements that contain value-relevant information stimulate trading. 
The KV model demonstrates that this portion of announcement period trading volume is 
positively related to contemporaneous price changes. 

Third, some trading arises because of private event-period information investors de- 
velop in conjunction with the content of the earnings announcement itself. In other words, 
investors trade in the wake of earnings announcements not just because of the value- 
relevance of the information or differences in their prior beliefs, but also because their 
private event-period information leads them to interpret the reported amounts differently 
than others who lack this information. KV characterize this third type of trading volume 
as "residual trading," that is, information-induced trading not positively related to differ- 
ences in the precision of investors' private pre-disclosure information (the first category of 
information-related trading) or to contemporaneous price changes (the second category 
of information-related trading). This third source of trading is similar in spirit to the con- 
sensus effect in HV, in that consensus decreases when investors rely more on private in- 
formation and less on public information. 

In the HV model, investors’ interpretations of a public announcement contain both 
common and idiosyncratic error components so that heterogeneous interpretations of a 
public announcement lead to price and volume reactions. HV examine an effect of infor- 
mation on trading called the “consensus” effect. Consensus refers to the extent to which 
investors share common information. The more investors rely on common (public) infor- 
mation, the more their forecast errors exhibit covariance. Conversely, a decrease in the 
covariance of forecast errors indicates a decrease in reliance on public information. HV 
predicts that consensus-decreasing news events cause trading volume. 

One limitation of HV is that it is a single-period model. Investors observe a signal 
about the value of the risky asset and then trade on the basis of their inferences. HV assumes 
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that investors have homogeneous prior expectations and that the precision of their pre- 
disclosure information is homogeneous. Because HV assumes all investors have homoge- 
neous prior expectations (i.e., prior consensus is equal to 1), public announcements cannot 
increase consensus across investors.* 

In contrast, the subsequent KV (1997, 396) study introduces a two-period model they 
describe as “more descriptive of real market settings." The predictions in KV are consistent 
with HV, but because KV has a pre-disclosure period in which investors can disagree, 
consensus can increase or decrease as a result of an earnings announcement. Consistent 
with the KV assumptions, the descriptive evidence reported in Table 1 of this article indi- 
cates that consensus decreases following earnings announcements for approximately 60 
percent of our sample. This evidence is consistent with KV's assertion that their model is 
more descriptive of actual earnings announcements. 

The first hypothesis concerns the "residual trading" in KV, that is, trading triggered 
by the third category of information: private event-period information. The second hypoth- 
esis predicts a negative relation between trading volume and consensus. Evidence consistent 
with both hypotheses provides reinforcing empirical support for KV's theoretical prediction 
that part of the trading volume response to earnings announcements arises because the 
earnings announcement leads investors to develop private event-period information. The 
.two hypotheses, stated in alternate form, follow: 


H1: Earnings announcement “residual” trading volume that is unexplained by price 
changes or differential pre-disclosure private information is positively associated 
with private event-period information. 


H2: Earnings announcement "residual" trading volume that is unexplained by price 
changes or differential pre-disclosure private information is negatively associated 
with changes in consensus. 


HI. RESEARCH DESIGN AND VARIABLE MEASUREMENT 

This section describes the models and the empirical proxies we use to test the hypoth- 
eses. Drawing on the BKLS empirical proxies for private information and consensus we 
use analysts’ (observable) belief revisions to draw inferences about investors’ (unobserva- 
ble) private event-period information. Specifically, we examine how quarterly earnings an- 
nouncements affect properties of analysts’ forecasts of annual earnings. In particular, we 
examine how the BKLS proxies for: (1) the precision (amount) of private information in 
analysts’ forecasts and (2) analysts’ consensus about annual earnings change following each 
quarterly earnings announcement. To isolate residual trading volume we control for trading 
associated with price changes and differential belief revisions arising from differences in 
the precision of pre-disclosure private information. Finding that increases in the precision 
of analysts’ private information and decreases in analysts’ consensus are associated with 
residual trading volume would support the theoretical proposition that part of the trading 
volume around earnings announcements arises because investors trade based on private 
event-period information. 

We use the following empirical model to test H1: 


* Conversely, in models in which event-period information is entirely common, such as Kim and Verrecchia (1991), 
public announcements cannot decrease consensus. Such models are not descriptive of the U.S. market given 
evidence in Barron et al. (2002) that earnings announcments typically decrease the BKLS measure of consensus. 
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EXVOL, = a) + a,APPRECISION, + a,LARETURN, + a,LJUMBLING, + e (1) 
where: 


EXVOL, = event-period-induced excess trading volume, defined as the natural log 
of the cumulative trading volume over the three-day event window (days 
—1 to +1 around the interim earnings announcement) minus the natural 
log of the firm-specific median trading volume, where median volume 
is the median volume for contiguous three-day periods over the 249 
trading days prior to the earnings announcement event window, where 
j is the firm-quarter subscript; 
APPRECISION, = a continuous variable measuring changes in the BKLS proxy (defined 
below) for the precision of analysts' private information; 
LARETURN, = the natural log of the absolute value of the sum of firm's daily returns 
in the three-day event window; 

LJUMBLING, — the natural log of 1 minus the Pearson correlation between: (1) the 
annual earnings forecasts issued by individual analysts in the 45 days 
before the interim earnings announcement; and (2) the annual earnings 
forecasts issued by the same analysts within the 30 days after the interim 
earnings announcement; and 

e, — the error term for firm-quarter j. 


In HV, trading occurs only when consensus decreases. Thus, we test H2 with an in- 
dicator variable set to 1 when consensus declines as well as a continuous variable. We use 
the following models to test H2: 


EXVOL, = b, + b,CONNEG, + bLARETURN, + b,LJUMBLING, + e, (2) 
EXVOL, = Cy  c,ACONSENSUS, + cJLARETURN, + cJLJUMBLING, + e, (3) 
where all variables are as defined above, and: 


CONNEG, = an indicator variable set to 1 if the BKLS proxy (defined below) for 
analysts' consensus declined around the quarterly earnings announce- 
ment, and 0 otherwise; and 

ACONSENSUS, = a continuous variable measuring changes in the BKLS proxy (defined 
below) for analysts’ consensus around the quarterly earnings 
announcement. 


The KV model characterizes residual trading volume as that portion of announcement- 
period volume not caused by (1) value-relevant information reflected in contemporaneous 
price changes, (2) differences in the precision of investors’ pre-disclosure private infor- 
mation reflected in differential belief revisions, and (3) non-informational (e.g., liquidity) 
factors. Accordingly, our tests control for trading volume arising from these other factors. 
We control for first factor, value-relevant information, with a measure of the absolute return 
(LARETURN) around the earnings announcement. We control for the second factor, dif- 
ferential prior precision of investors’ information, with a measure of belief jumbling 
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(LJUMBLING). We control for the third factor, non-information-based trading, by subtract- 
ing firm-specific median trading volume from the dependent variable, EXVOL. This meth- 
odology is similar to the approach Atiase and Bamber (1994) and Kross et al. (1994) use 
to abstract from non-announcement, firm-specific levels of liquidity trading and ongoing 
levels of information-based trading unrelated to the theoretical notion of announcement- 
period residual volume. 

The explanatory variables of interest, APPRECISION, CONNEG, and ACONSENSUS, 
are based on the BKLS proxies detailed in the next section; this is followed by a discussion 
of the control variables. Due to extreme values in the empirical proxy PPRECISION, we 
estimate Equation (1) as a rank regression, with each variable being ranked within each 
quarter, while Equations (2) and (3) are estimated with ordinary least squares. 


Empirical Proxies for Private Event-Period Information and Consensus 

This section describes our empirical proxies for KV's theoretical construct, the preci- 
Sion, accuracy, or amount of private information, which we label PPRECISION, and HV's 
theoretical consensus construct or the commonality of investors' information, which we 
label CONSENSUS. 

KV define the amount of private event-period information as the precision of investors' 
private information signals where precision is the inverse of the variance of the signal 
around its realization (or signal accuracy). Note that as analysts possess more information, 
precision increases. HV define investor consensus as the commonality of investors’ infor- 
mation, gauged by the correlation of their forecast errors around an assumed expected error 
of zero (i.e., expectations are assumed to be unbiased). 

We empirically measure the extent of private event-period information and consensus 
with the following BKLS proxies: 


PPRECISION = 2 = (1 — CONSENSUS) - (5 (4) 


CONSENSUS — 1 — D/V (5) 
where: 


D = dispersion in analysts’ forecasts i.e., the sample variance of the individual forecasts 
(FC) around the mean forecast (FC), measured as > (FC, — FC)?/(n — 1), where n 
I1 


is the number of forecasts; and 
V = the mean of the squared differences between individual analysts' forecasts (FC; and 


reported earnings per share (EPS) measured as >) (FC, — EPS)/n. 
i=1 


ta 


Ross (1989) and Kandel and Pearson (1995) considered it unlikely that liquidity needs would be large enough 
to account for observed ongoing levels of security trading. Abstracting from the average non-announcement 
level of trading will likely abstract some ongoing information-based trading. Thus, our measure is a conservative 
estimate of information-based trading around earnings announcements. 

Our findings are qualitatively identical when we use the natural log of the cumulative trading volume over the 
three-day event window as the dependent variable and the natural log of the median preannouncement volume 
as an additional control variable. 


a 
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To understand the intuition behind these BKLS measures assume the number of fore- 
casts (n) is large and consider the extreme cases of D/V equal to 0 or 1. First, note that 
with a large number of forecasts, error due to common information is simply the difference 
between the mean forecast (FC) and realized earnings per share (EPS), i.e., idiosyncratic 
error is completely averaged out of the mean leaving only common error. Second, error 
due to private information is the difference between individual forecasts (F'C;) and the mean 
forecast (FC). D/V = 1 and CONSENSUS equals 0 when the mean forecast equals realized 
earnings. In this case, all forecast error is idiosyncratic and the BKLS model suggests that 
forecasts are based entirely on private information. In contrast, D/V = 0 and CONSENSUS 
equals 1 when all forecasts are exactly equal to the mean forecast. In this case, all forecast 
error is common and the BKLS model suggests that forecasts are based entirely on common 
information. In the unlikely case that all forecasts exactly equal realized earnings consensus 
is undefined because V is equal to O and D/V is infinite, consistent with perfectly accurate 
information. 

A close examination of Equations (4) and (5) reveals that the precision (or accuracy) 
of private information (PPRECISION) and consensus (CONSENSUS) are closely related 
but not identical constructs. Specifically, CONSENSUS measures the degree of common 
information relative to total information. CONSENSUS can decrease either due to an in- 
crease in the amount of private information or a decrease in the amount of common in- 
formation. In contrast, PPRECISION reflects the absolute precision (or accuracy) of infor- 
mation (1/V) multiplied by the relative degree to which this information is private in nature 
(D/V). PPRECISION can only increase because the accuracv of private information in- 
creased.’ Consistent with the HV and KV theoretical constructs representing related but 
different concepts, these two measures, PPRECISION and CONSENSUS, reflect private 
information in different ways.’ 

In Regression (1) APPRECISION 1s the difference between PPRECISION based on 
annual earnings forecasts issued by individual analysts in the 45 days before the interim 
earnings announcement and PPRECISION estimated with annual earnings forecasts issued 
by the same analysts within the 30 days after the interim earnings announcement. As 
mentioned above, because PPRECISION is unbounded, Regression (1) is estimated using 
rank regression with each variable ranked within each quarter. 

In Regression (2) CONNEG is an indicator variable set to 1 if consensus declined 
across an earnings announcement. ACONSENSUS in Regression (3) is the difference be- 
tween CONSENSUS in annual earnings forecasts issued by analysts in the 45 days before 
the interim earnings announcement and CONSENSUS in annual earnings forecasts issued 
by the same analysts in the 30 days after the interim earnings announcement.? 


7 Even in the unlikely event where individual forecast accuracy (1/V) is held constant and the relative degree to 
which information is private (D/V) increases, the mean forecast still moves closer to realized earnings. Thus, 
increases in private information always suggest some type of increase in forecast accuracy either in terms of 
more accurate indvidual forecasts or a more accurate mean forecast or both. 

* Note: Our empirical measure PPRECISION is equivalent to KV theoretical ccnstruct only under certain restrictive 
theoretical assumptions as described in BKLS. In contrast, consensus reflects the HV theoretical construct under 
a much less restrictive set of theoretical assumptions. 

? Barron et al. (2002) show that the decrease in consensus after earnings announcements is roughly the same 
using (1) a shorter 30-day pre-announcement window and a 10-day post-announcement window and (2) a longer 
45-day pre- and 30-day post-announcement windows. This similarity suggests that changes in BKLS consensus 
measured with the longer windows are primarily attributable to the effects af earnings announcements. Because 
we require a minimum of five analysts to revise their forecasts, using the wider window expands the sample. 
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We test the relation between change in consensus and residual volume in two ways for 
the following reasons. Consistent with the HV theory's focus on decreases in consensus, 
in Regression (2) we use an indicator variable set to 1 if consensus declined across an 
earnings announcement period, CONNEG. This approach tests whether earnings announce- 
ments that reduce consensus stimulate more trading volume than announcements that in- 
crease consensus. This dichotomous specification is relatively insensitive to influential ob- 
servations, measurement error, and misspecifications such as nonlinearity. However, because 
the indicator variable likely removes some information, we re-estimate Regression (3) with 
a continuous variable measuring the change in consensus (and as indicated in our descriptive 
statistics consensus increases for approximately 40 percent of our sample). 

Consistent with the argument that declines in consensus likely reflect increases in pri- 
vate information rather than decreases in common information, BKLS consensus is nega- 
tively related to BKLS precision of analysts' private information. Specifically, when all 
analysts have the same quality of private information, BKLS consensus can be written as 
the ratio of common to total (common plus private) information: 


h 
pel. (6) 
where h is the precision of all analysts’ common information and s is the precision of each 
analyst's private information. As is obvious from inspection of Equation (6), an increase 
in private event-period information, s, is not the only possible explanation for a decrease 
in consensus, p. A decrease in BKLS consensus could also occur because the announcement 
reduced the amount of common information available, h. However, it is not. very plausible 
that earnings announcements routinely reduce the total amount of common information, A. 
Indeed, Barron et al. (2002) show that, on average, earnings announcements increase the 
amount of common information. Moreover, consistent with our conjecture that decreases 
in consensus are more likely attributable to increases in s than to decreases in h, we find 
that the Spearman correlation between APPRECISION and ACONSENSUS for our sample 
is —0.514, which is significant at 0.0001. Also supporting this conjecture, Botosan et al. 
(2004) find a negative correlation between analysts' consensus and the precision of their 
private information during a later time period, 1993 to 2001. These findings support our 
interpretation of decreases in consensus as an indicator of increases in private information 
around earnings announcements. 


Control Variables 
LARETURN 

We abstract from trading attributable to value-relevant information in the earnings an- 
nouncement by including the natural log of the absolute value of firm returns during the 
three-day event window. This controls for trading volume associated with price changes 
resulting from the earnings announcement, one of the types of information-based trading 
in KV. 


LJUMBLING 

We include the measure of belief reordering, or jumbling, used by Barron (1995) and 
Bamber et al. (1997) to control for trading attributable to differences in the precision of 
investors' private pre-disclosure information, another type of information-based trading in 
KV. Both studies define "belief jumbling" as differential belief revisions that change an 
individual's expectation relative to the distribution of expectations held by others—in other 
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words, the reordering of beliefs (or forecasts) across investors (or analysts). We measure 
belief jumbling as the complement of the correlation between the relative positions of 
individuals' pre- and post-announcement forecasts. Bamber et al. (1997) contend that the 
volume associated with belief jumbling around earnings announcements likely reflects the 
extent to which announcements trigger disagreement about a security's value attributable 
to differences in the precision of individual investors' private pre-disclosure information. 

A limitation of this belief jumbling measure is that it captures more than just the 
differential precision of analysts' private pre-disclosure information. Specifically, as noted 
in Kandel and Pearson (1995), belief jumbling also captures belief revisions that violate 
the usual economic assumption that rational agents with identical information will have 
similar expectations. Fortunately, we seek to control for this form of disagreement as well 
because it likely leads to some trading unassociated with the development or use of private 
event-period information. 

Nonetheless, some jumbling may be caused by reductions in consensus that reflect the 
development of private event-period information. To the extent that controlling for jumbling 
abstracts from part of the effect of private event-period information, this weakens our tests 
and biases against finding the hypothesized results. However, we expect a minimal cor- 
relation, if any, between jumbling and our proxy for event-period private information, 
CONSENSUS, because consensus is the correlation in forecast errors at a point in time, 
while belief jumbling is the correlation over time between pre- and post-forecasts. Any 
given correlation in forecast errors at a point in time is consistent with many different 
relative positions of individual forecasts within the entire distribution of forecasts. As a 
result, significant changes in the relative positions of individual forecasts (or belief jum- 
bling) may occur without changing the overall correlation in forecast errors (1.e., forecasts 
may jumble without changing forecast dispersion [D] or the variance of forecasts errors 
[V]. Likewise, significant changes in the degree to which forecast errors are correlated at 
a point in time (consensus) need not be associated with changes over time in the relative 
positions of individual forecasts within the distribution of forecasts (jumbling). That is, 
forecasts may converge and exhibit a more common error without becoming reordered (or 
jumbled). Indeed, as shown later in Table 2, in our sample belief jumbling is not signifi- 
cantly correlated with either ACONSENSUS, CONNEG, or APPRECISION. Thus, control- 
ling for jumbling is unlikely to abstract from the effect of interest —trading volume induced 
by the development of private event-period information. 

In summary, we test H1 and H2 by testing for an association between: (1) excess trading 
volume and (2) either (a) changes in the BKLS proxy for the precision (amount) of analysts' 
private information or (b) changes in the BKLS proxy consensus. Our tests control for 
trading associated with price changes and the Bamber et al. (1997) proxy for differences 
in the precision of analysts' pre-disclosure private information. À positive association be- 
tween trading volume and our proxy for the precision of analysts' private information would 
support the KV prediction that some investors develop or acquire private event-period in- 
formation that affects their trading. A negative relation between trading volume and changes 
in consensus would support the HV model of trading volume. Finding both relations would 
strengthen the inference that investors develop or acquire private event-period information 
around earnings announcements, which is sufficiently important to influence their actions 
(i.e., their trading). 
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IV. THE SAMPLE AND EMPIRICAL EVIDENCE 
This section describes the sample and reports on the results of our tests. The sample 
consists of firm-quarter observations for quarters one through three between the first quarter 
of 1984 and the last quarter of 1996,19 inclusive, that meet the following criteria: 


(1) The quarterly (interim) earnings announcement date is available on the quarterly 
Compustat Primary, Supplementary, or Tertiary file; 

(2) Five or more individual analysts on I/B/E/S issued annual earnings forecasts 
within a 45-day pre-announcement window and then revised their forecast within 
a 30-day post-announcement window; and 

(3) Trading volume and stock return data are available on the CRSP NYSE, AMEX, 
and NASDAQ files. 


A total of 2,977 firm-quarter observations meet criteria (1) and (2), though 116 are 
missing volume or returns data. In supplemental analyses we compare our findings to 
evidence in earlier empirical research that used different proxies for investor disagreement. 
Because the calculation of these previously used variables requires deflating by the mean 
earnings-per-share forecast, we also deleted 137 observations with earnings-per-share or 
mean annual earnings forecasts of less than 0.10 in absolute value (to avoid small denom- 
inator problems in these tests). The final sample consists of 2,724 firm-quarter observations. 

The second sample selection criterion is the most restrictive, but it ensures that pre- 
and post-announcement measures of consensus are based on recent forecasts by the same 
analysts both before and after earnings announcements. As Bamber et al. (1997, 1999) 
point out, this requirement increases the precision of forecast-based proxies while helping 
to ensure that test results are uninfluenced by stale forecasts. This criterion also helps to 
ensure that we are primarily capturing changes in analyst consensus attributable to the 
quarterly earnings announcement. One limitation is that this criterion leads to a sample of 
relatively large firms with large quarterly earnings surprises. 


Descriptive Statistics 


Table 1 presents descriptive statistics for the sample. As evidenced by the large price 
reaction, the sample firms have relatively large quarterly earnings surprises. The mean 
(median) absolute return (ARETURN) for the three days surrounding the quarterly earnings 
announcement is 0.061 (0.046). In addition, the sample firms are quite large, the mean 
(median) market value of equity is $7,169 ($2,950) million, which is in the ninth decile of 
market value for all Compustat firms. Again, this result probably reflects the requirement 
of five forecast revisions. 

For descriptive purposes, we report summary statistics on both the percentage of out- 
standing shares traded over the three days surrounding the earnings announcement (VOL) 
and our measure of excess trading volume (EXVOL) that controls for non-information based 
trading. The mean (median) of VOL in this event window is 2.04 percent (1.15 percent), a 
result comparable to the 1.54 percent (1.02 percent) Bamber et al. (1997) reported for a 
sample subject to similar restrictions from an earlier (but overlapping) time period. The 
dependent variable, EXVOL, controls for non-information-based trading by subtracting the 
log of firm-specific median volume (defined as the median amount of volume for contiguous 
three-day periods over the 249 trading days prior to the earnings announcement event 
window) from the log of the event-period volume. EXVOL has a mean (median) of 0.384 
(0.345). Unlike prior measures of excess volume that are positively skewed, EXVOL is 


10 This is a representative time period that avoids the technology stock price run-up and crash. 
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TABLE 1 
Descriptive Statistics for a Sample of 2,724 Interim Earnings Announcements 
over the Period 1984 to 1996" 


25th 75th 

Variable Mean Percentile Median Percentile 
VOL (96 traded) 2.0496 0.66% 1.15% 2.14% 
EXVOL 0.384 0.005 0.345 0.737 
APPRECISION 278,647 0.011 16.5 647.6 
CONNEG 0.604 0 1 1 
ACONSENSUS —0.082 —0.234 —0.026 0.046 
Ranked APPRECISION 463.53 227.5 454.5 681.5 
Control Variables 

ARETURN 0.061 0.029 0.046 0.074 
LARETURN —3.081 —~ 3.526 —3.070 —2.600 
LJUMBLING —1.018 —1.563 —0.845 —0.257 
SIZE ($ millions) $7,169 $1,345 $2,950 $7,253 


"Each interim announcement has at least five analysts’ forecasts of annual earnings updated within 45 days 
before the quarterly (interim) announcement, with the same five analysts updating within 30 days after the 
announcement. 

Variable Definitions: 

VOL = the percent of outstanding shares traded over the three-day event window (days —] to +1 
relative to the quarterly earnings announcement date); 

EXVOL = event-period induced excess trading volume, defined as the natural log of the cumulative 
trading volume over the three-day event window (days —1 to +1 around the interim earnings 
announcement) minus the natural log of the firm-specific median trading volume, where 
median volume is defined as the median amount of volume for contiguous three-day periods 
over the 249 trading days prior to the earnings announcement event window; 

APPRECISION = a continuous variable measuring changes in the BKLS proxy for the precision (or amount) of 

analysts private information; 
CONNEG = an indicator variable set to 1 if the BKLS proxy for analysts’ consensus declined around the 
earnings announcement, and 0 otherwise; 

ACONSENSUS = a continuous variable measuring changes in the BKLS proxy for analysts’ consensus around 

the earnings announcement; 
ARETURN - the absolute value of the sum of a firm's daily returns in the three-day event window; 
LARETURN = the natural log of the absolute value of the sum of firm's daily returns in the three-day event 
window; 

LJUMBLING = the natural log of 1 minus the Pearson correlation between the annual earnings forecasts 
issued by individual analysts in the 45 days before the interim earnings announcement and 
the annual earnings forecasts issued by the same analysts within the 30 days after the interim 
earnings announcement; and 

SIZE == market value of equity, measured six days prior to the three-day event window. 


approximately normally distributed, because it is the difference between two logged mea- 
sures of trading volume. 

We use Equation (4) to calculate the proxy for the precision of analysts' private infor- 
mation. As discussed previously, PPRECISION is not naturally scaled and has extreme 
values. Accordingly, to test H1 we estimate Equation (1) using rank regression. Consistent 
with scale problems attributable to differences in the magnitude of error across firms or the 
fact that PPRECISION is unbounded, the mean change in PPRECISION is 278,647 and 
the median is 0.165. The ranked values of change in PPRECISION are much better behaved, 
with a mean (median) of 463.5 (454.5). À rank regression is used in Regression (1) with 
each variable ranked within each quarter, due to slightly different numbers of observations 
in each quarter (868 in quarter 1, 775 in quarter 2, and 1081 in quarter 3) the mean and 
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median ranked values of PPRECISION are not equal. À weakness of this proxy is that it 
may be subject to measurement error caused by scale problems not fully accounted for by 
rank regression. 

We use Equation (5) to calculate the proxy for analysts’ consensus. Consensus declines 
for 60.4 percent of the sample firms, the mean of CONNEG. The mean (median) change 
in analysts’ consensus, ACONSENSUS, is —0.082 (—0.026). Even with this broader sample 
of earnings announcements, the result is similar to the declines in consensus Barron et al. 
(2002) report. In contrast to PPRECISION, the consensus proxies are naturally scaled (be- 
tween —1 and 1) because CONSENSUS reflects the amount of private information relative 
to other information. 

The control variables are LARETURN and LJUMBLING. The mean (median) of 
LARETURN is —3.08 (—3.07). The mean (median) of our unlogged jumbling variable is 
0.538 (0.429), which is comparable to the 0.477 (0.589) that Bamber et al. (1997) reported. 


Pairwise Correlations 


Table 2 presents Spearman (above diagonal) and Pearson (below diagonal) correlations 
among all variables included in our analyses. Because they are similar, we discuss the 
Pearson correlations. The correlation between excess trading volume and the ranked change 
in the precision of analysts' private information is 0.030, which is insignificant (p-value of 
0.1203). However, quarterly earnings announcements that decrease consensus about upcom- 
ing annual earnings (CONNEG = 1) spur more trading volume than do announcements 
that increase consensus (p-value of 0.0001) and change in consensus correlates negatively 
with excess volume with a correlation coefficient of —0.064 (p-value of 0.0008). Untabu- 
lated analyses reveal the median announcement period increase in volume to be 38 percent 
greater for firms with a decline in consensus than it is for firms with an increase in con- 
sensus. These results provide preliminary evidence consistent with the hypothesis that those 


TABLE 2 
Spearman (above diagonal) and Pearson (below diagonal) Correlations among Excess Trading 
Volume, Ranked Change in Private Event-Period Information, Declines and Changes in 
Analysts’ Consensus, and Control Variables* 
(n = 2,724; p-values in parentheses) 


Ranked 
EXVOL APPRECISION CONNEG ACONSENSUS LARETURN LJUMBLING 


Variable —  EXVOL APPRECISION CONNEG ACONSENSUS LARETURN LJUMBLING 
EXVOL 0.031 0.084 —0.065 0.445 0.090 
(0.1008) (0.0001) (0.0007) (0.0001) (0.0001) 
Ranked 0.030 0.390 —0.469 0.039 0.019 
APPRECISION (0.1203) (0.0001) (0.0001) (0.0400) (0.3251) 
CONNEG 0.092 0.379 —0.847 0.080 0.013 
(0.0001) (0.0001) (0.0001) (0.0001) (0.5078) 
ACONSENSUS  —0.064 —0.504 —0.664 —0.071 —0.018 
(0.0008) (0.0001) (0.0001) (0.0002) (0.3351) 
LARETURN 0.429 0.037 0.079 —0.058 0.100 
(0.0001) (0.0516) (0.0001) (0.0025) (0.0001) 
LJUMBLING 0.090 0.028 0.018 —0.021 0.081 
(0.0001) (0.1508) (0.3506) (0.2788) (0.0001) 


* Table 1 describes the sample and variables. 
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quarterly earnings announcements that reduce the consensus about upcoming annual earn- 
Ings lead to more announcement-induced trading volume. In addition, consistent with in- 
creases In private information reducing consensus, the correlation between CONNEG and 
APPRECISION, is significantly positive (p-value of 0.0001). Similarly, the correlation be- 
tween ACONSENSUS and APPRECISION is significantly negative, —0.469 (p-value of 
0.0001). 

The control variable for information-induced volume related to price changes is 
LARETURN. Consistent with this variable acting as a control, the correlation between excess 
volume and LARETURN is 0.429 (p-value of 0.0001). Similarly, the correlation between 
LJUMBLING and EXVOL is 0.090 (p-value of 0.0001). 

We assume that the jumbling proxy primarily reflects the extent to which earnings 
announcements trigger new-found disagreement about the value of a security attributable 
to differences in the precision of individual investors' private pre-disclosure information. 
Although, jumbling may also reflect a reduction in consensus attributable to private event- 
period information, insignificant correlations between jumbling and both measures of 
changes in consensus (p-values of 0.2788 and 0.3506, respectively) suggest that in our 
sample jumbling does not reflect changes in consensus. In addition, in models such as KV, 
belief jumbling increases with both differences in precision and the magnitude of price 
changes. Consistent with this, in our sample the correlation between LJUMBLING and 
LARETURN is significantly positive (p-values of 0.001). These results are consistent with 
our interpretation of the jumbling variable as a control for differential precision of private 
pre-disclosure information. 


Results of Hypothesis Tests 


To address the dependency that results from including in the sample multiple obser- 
vations from the same firm, we calculate the t-statistics reported in Tables 3 and 4 using 
the Froot (1989) procedure. Table 3 reports the results of testing H1 using rank regression 
analysis to estimate Equation (1). Our proxy for KV private event-period information is the 


TABLE 3 
Rank Regression of Excess Trading Volume around Quarterly Earnings Announcements on 
Changes in the Precision of Analysts’ Private Information and Control Variables 


(n = 2,724) 

Expected Coefficient 

Variable* | Sign (p-value) 
Intercept 193.7 
(0.0001) 
APPRECISION + 0.047 
(0.0054) 
LARETURN -+ 0.069 
(0.0001) 
LJUMBLING + 0.466 
(0.0001) 
Adjusted R2 23.7% 


* The notes to Table 1 describe the sample and variables. 
b p-values are one-sided where tests predictions are made and are calculated using the Froot (1989) method. 
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precision (amount) of analysts' private information. Consistent with H1, the coefficient on 
APPRECISION is significantly positive, 0.047 (p-value of 0.0054). This finding suggests 
that earnings announcements that increase analysts’ private information also induce inves- 
tors to trade in a manner consistent with an increase in their private information. As 
expected, the coefficients on the control variables for price changes (LARETURN) and 
differential precision of pre-disclosure information (LJUMBLING) are both positive and 
significant, consistent with prior research (Bamber et al. 1997). 

Table 4 reports the results of testing H2 by estimating Equations (2) and (3). The 
positive CONNEG coefficient (p-value of 0.0003) is consistent with the hypothesis that 
earnings announcements that decrease consensus are associated with more excess trading 
volume than earnings announcements that increase consensus. In addition, in the esti- 
mation of Equation (3), the coefficient on the continuous change-in-consensus variable 
ACONSENSUS is negative (p-value of 0.0137). These results suggest that the more a quar- 
terly earnings announcement decreases consensus about the upcoming annual earnings, the 
greater the residual trading volume around the earnings announcement. Again, the signs 
and significance levels of the coefficients on the control variables are as expected. Because 
decreases in consensus are consistent with some investors having private event-period in- 
formation, these results suggest that this new private information causes investors to value 
securities differently and to trade. 


TABLE 4 


Regression of Excess Trading Volume around Quarterly Earnings Announcements 
on Changes and Declines in Analysts’ Consensus, and Control Variables 


(n = 2,724) 
Expected Coefficient 
Variable* Sign (p-value)? 
Intercept 1.330 1.372 
(0.0001) (0.0001) 
CONNEG + 0.069 
(0.0003) 
ACON u —0.067 
(0.0137) 
Controls: 
LARETURN + 0.310 0.312 
(0.0001) (0.0001) 
LJUMBLING + 0.031 0.031 
(0.0003) (0.0003) 
Adjusted R2 18.76% 18.94 


* The notes to Table 1 describe the sample and variables. 
P p-values are one-sided where tests predictions are made and are calculated using the Froot (1989) method. 


H Supplementary analysis revealed that BKLS measures of changes in the precision of public information are not 
significantly associated with residual trading volume following earnings announcements. Thus, our findings are 
not simply due to the fact that analysts' and investors' total amount of information increases around earnings 
announcements. 

12 We examined the effects of extreme values on our results by winsorizing our dependent variable and by deleting 
influential observations (two passes). We also estimated models (2) and (3) using rank regression similar to 
model (1). All inferences remain unchanged. 
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V. SUPPLEMENTAL ANALYSES 

Prior empirical research has found that other variables are associated with excess trad- 
ing volume around earnings announcements. Consequently, we investigate whether changes 
in the proxies for the amount of private information and consensus continue to be important 
indicators of earnings-announcement-induced trading volume after controlling for these 
other variables. 

First, as a proxy for market-wide trading volume, we control for the natural log of the 
percentage of shares traded over the three-day event window for all firms on the CRSP 
NYSE, AMEX, and NASDAQ files. Second, we include changes in the dispersion of an- 
alysts' forecasts, measured as the change in the standard deviation of annual earnings fore- 
casts issued 45 days before and 30 days after the interim earnings announcement, scaled 
by the absolute value of the mean annual earnings forecast.'? Untabulated analyses reveal 
that the coefficient on APPRECISION remains significantly positive (p-value of 0.0307). 
Similarly, the coefficient on CONNEG remains significantly positive and the coefficient on 
ACONSENSUS significantly negative (p-values of 0.001 and 0.052, respectively). Thus, 
including these additional control variables does not affect our inferences. Consistent with 
prior research, the coefficients on market-wide volume and change in dispersion are positive 
with p-values less than 0.051 (Bamber et al. 1997; Ziebart 1990).'^ Even though the same 
forecasts are used to construct ACONSENSUS, APPRECISION, belief jumbling, and change 
in forecast dispersion, both ACONSENSUS and APPRECISION retain incremental explan- 
atory power, consistent with the HV and KV theoretical models. 

We also added to the regression models two variables that Bamber (1987) showed to 
be related to trading volume around earnings announcements. First, as a direct measure of 
earnings surprise, we control for the absolute value of the difference between the mean 
analysts' forecast and actual earnings per share (before extraordinary items), deflated by 
the absolute value of the mean forecast. Second, we also control for firm size, measured 
as the market value of equity at the end of day —6 relative to the earnings announcement. 
Following prior research, we log-transformed both of these variables. When these variables 
are added to the regression model, the coefficient on APPRECISION remains significantly 
positive (p-value of 0.0029). The coefficients on earnings surprise and firm size also remain 
significantly positive with p-values of 0.0021 and 0.0003, respectively. Similarly, the finding 
that decreases in consensus stimulate trading remained unchanged with p-values of less 
than 0.008 for both coefficients. Both earnings surprise and firm size are, however, insig- 
nificant when added to the regression models with ACONSENSUS.'^ 

To test whether our findings are robust across differing levels of unexpected earnings, 
we also repeated our analysis on subsamples based on quartiles of the magnitude of un- 
expected earnings.'® The results for APPRECISION are mixed—the coefficient is signifi- 
cantly positive only in quartiles two and three. The coefficient on the dichotomous variable, 


55 BKLS decompose dispersion into both consensus and overall uncertainty. Including the change in their uncer- 
tainty measure in our analyses has no effect on our inferences. 

^ Our supplementary findings concerning changes in forecast dispersion are consistent with Barron and Karpoff's 
(2004) theoretical prediction that, after controlling for consensus effects, transaction costs prevent more trades 
when beliefs become less dispersed than when beliefs become more dispersed. 

55 In the model with APPRECISION, the coefficients on earnings surprise and size are significantly positive, 
consistent with prior research. However, in the models with consensus the coefficients on earnings surprise and 
size are insignificant. Further analysis shows the earnings surprise and firm size effects are subsumed when we 
include absolute returns in the model. 

16 The results are insensitive to the inclusion of indicator variables that control for whether the earnings announce- 
ment pertains to the first, second, or third fiscal quarter. 
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CONNEG, is significantly positive across all four quartiles but the coefficient on the con- 
tinuous measure, ACONSENSUS, although consistently of the hypothesized sign, is signif- 
icantly negative only in quartiles two and four. Measurement error or some unidentified 
nonlinearity may vary across the quartiles. Alternatively, there may be insufficient intra- 
quartile variation in the magnitude of earnings surprise in our sample because our firms all 
evidence relatively large earnings surprises. 

Conference calls to selected analysts could result in private event-period information 
of a type that is not the focus of this study (that is private information that is not the result 
of "interactive" knowledge obtained prior to the announcement that can only be capitalized 
upon after seeing the announcement). Tasker (1997, 8) reports that conference calls to 
analysts around earnings announcements were very uncommon before the mid-1990s. Con- 
sequently, we repeated our tests on the subsample of 1,226 earnings announcements oc- 
curring prior to 1991. The results are qualitatively similar to those for the larger sample, 
which provides evidence that the primary findings are not the result of conference calls.!’ 

In summary, proxies for the theoretically motivated variables—precision of analysts’ 
private information and consensus—remain significant even after controlling for other, less 
theoretical, determinants of trading volume, such as firm size. 


VL SUMMARY AND CONCLUSIONS 

Kim and Verrecchia's (1997) theoretical model predicts that private information inferred 
at the time of an information release (here, an earnings announcement) is associated with 
greater trading volume. We provide empirical evidence that announcements that lead to 
increases in analysts’ private information (as measured! by Barron et al.’s [1998] empirical 
proxies) also lead to increases in trading volume suggesting that some investors, as well as 
analysts, develop or acquire private event-period information. In addition, consistent with 
Holthausen and. Verrecchia's (1990) theoretical model, decreases in consensus lead to in- 
creases in trading volume. Because consensus declines when private information increases, 
this finding is consistent with theory and provides reinforcing evidence that earnings an- 
nouncements lead some investors to trade based on private event-period information. 

Our analyses also reveal that our theoretically derived measures of analysts' private 
information and consensus remain significant after controlling for other determinants of 
trading volume documented in prior research (e.g., change in dispersion of analysts' fore- 
casts, belief jumbling, earnings surprise, and firm size). Overall, our results support the 
theoretical proposition that part of the trading around earnings announcements arises be- 
cause investors use new private information developed at the time of the announcement in 
addition to prior and contemporaneous value-relevant information. 

Although explaining more of the trading volume around earnings announcements is 
important simply because it is the most detectable indicator that financial accounting influ- 
ences individual investors’ decision making and actions (Cready and Hurtt 2002), one can 
best appreciate the importance of our findings by viewing them together with the findings 
of other accounting research concerning firms' costs of capital. Botosan et al. (2004) find 
analysts' private information is positively associated with a firm's cost of capital, and Barron 
et al. (2002) find that earnings announcements trigger the release of a large percentage of 


HW Earnings management to meet or beat analysts' forecasts adds measurement error to the BKLS proxies for the 
precision of private information and consensus. Fortunately, however, earnings management to meet or beat 
analysts' forecasts was also less common prior to 1992 (Richardson et al. 2004). This information is important 
because this type of earnings management poses a potential threat to inferences derived using the BKLS method. 
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analysts' private information. Kim and Verrecchia (1994, 1997) contend that it is trade 
based on the type of private information reported by Barron et al. (2002) that adversely 
affects firms' costs of capital. Thus, when considered together with the findings of others, 
our results are consistent with the argument that private information triggered by earnings 
announcements is a factor in determining firms’ costs of capital. 

One caveat centers on the fact that the sample selection criteria led to a sample of large 
firms (the median firm is in the ninth decile of the Compustat population) with relatively 
large quarterly earnings surprises. The extent to which our inferences generalize to other 
populations of smaller firms or less surprising earnings announcements remains an open 
question. 


REFERENCES 

Atiase, R., and L. Bamber. 1994. Trading volume reactions to annual accounting earnings announce- 
ments: The incremental role of pre-disclosure information asymmetry. Journal of Accounting 
and Economics 17 (May): 309—329. 

Bamber, L. 1987. Unexpected earnings, firm size, and trading volume around quarterly earnings 
announcements. The Accounting Review (July): 510—532. 

, O. Barron, and T. Stober. 1997, Trading volume around earnings announcements and different 
aspects of disagreement. The Accounting Review (October): 575—597. 

— , and . 1999. Differential interpretations and trading volume. Journal of Fi- 
nancial and Quantitative Analysis 34 (3): 369—386. 

Barron, O. 1995. Trading volume and belief revisions that differ among individual analysts. The 
Accounting Review (October): 581—597. 

, O. Kim, S. Lim, and D. Stevens. 1998. Using analysts' forecasts to measure properties of 

analysts’ information environment. The Accounting Review (October): 421-33. 

, D. Byard, and O. Kim. 2002. Changes in analysts' information around earnings announce- 

ments. The Accounting Review (October): 821—46. 

, and J. Karpoff. 2004. Information precision, transaction costs, and trading volume. Journal 
of Banking and Finance (June): 1207-1223. 

Beaver, W. 1968. The information content of annual earnings announcements. Journal of Accounting 
Research 6 (Supplement): 67—92. 

Botosan, C., M. Plumlee, and Y. Xie. 2004. The role of private information precision in determining 
cost of equity capital. Review of Accounting Studies (June-September): 233—259. 

Cready, W., and D. Hurtt. 2002. Assessing market response using return and volume metrics. The 
Accounting Review (October): 891—909. 

Dontoh, A., and J. Ronen. 1993. Information content of accounting announcements. The Accounting 
Review (October): 857-869. 

Froot, K. 1989. Consistent covariance matrix estimation with cross-sectional dependence and heter- 
oskedasticity in financial data. Journal of Financial and Quantitative Analysis (September): 
333—355. 

Holthausen, R., and R. Verrecchia. 1990. The effect of informedness and consensus on price and 
volume behavior. The Accounting Review (January): 191—208. 

Kandel, E., and N. Pearson. 1995, Differential interpretation of public signals and trade in speculative 
markets. Journal of Political Economy 103 (4): 831—872. 

Kim, O., and R. E. Verrecchia. 1991. Trading volume and price reactions to public announcements. 
Journal of Accounting Research 29 (Autumn): 302-321. 

, and . 1994. Market liquidity and volume around earnings announcements. Journal of 

Accounting and Economics (January): 41—67. 

, and . 1997. Pre-announcement and event-period private information. Journal of Ac- 

counting and Economics (December): 395—419. 
































The Accounting Review, April 2005 


Evidence That Investors Trade on Private Event-Period Information 421 


Kross, W., G. Ha, and F. Heflin. 1994. A test of risk clientele effects via an examination of trading 
volume response to earnings announcements. Journal of Accounting and Economics (July): 67— 
87. 

Richardson, S., S. Teoh, and P. Wysocki. 2004. The walkdown to beatable analyst forecasts: The role 
of equity issuance and insider trading incentives. Contemporary Accounting Research 21 (4): 
885—924. 

Ross, S. 1989. Discussion: Intertemporal asset pricing. In Theory of Valuation, edited by S. Bhatta- 
charya, and G. Constantinides, 85—96. Totowa, NJ: Rowman & Littlefield. 

‘Tasker, S. 1997. Voluntary disclosure as a response to low accounting quality: Evidence from quarterly 
conference calls. Ph.D. dissertation, Massachusetts Institute of Technology, Cambridge, MA. 

Ziebart, D. 1990. The association between consensus of beliefs and trading activity surrounding earn- 
ings announcements. The Accounting Review (April): 477-488. 


The Accounting Review, April 2005 


THE ACCOUNTING REVIEW 
Vol. 80, No. 2 

2005 

pp. 423—440 


A Temporal Analysis of Quarterly 
Earnings Thresholds: Propensities and 
Valuation Consequences 


Lawrence D. Brown 
Georgia State University 


Marcus L. Caylor 
Georgia State University 


ABSTRACT: Applying a Burgstahler and Dichev (1997)/Degeorge et al. (1999) type 
methodology to quarterly data for the 1985-2002 time period, we show that, since the 
mid-1990s, but not before then, managers seek to avoid negative quarterly earnings 
surprises more than to avoid either quarterly losses or quarterly earnings decreases. 
Our findings suggest that the quarterly earnings threshold hlerarchy proposed by 
Degeorge et al. (1999) does not apply to recent years, and that managers' claim that 
avoiding quarterly earnings decreases Is the threshold they most seek to achleve 
(Graham et al. 2004) is Inconsistent with their actions. We provide an intuitively ap- 
peallng economic rationale for why the shift in threshold hierarchy occurred; slnce the 
mid-1990s, but not before then, Investors unambiguously rewarded (penalized) firms 
for reporting quarterly earnings meeting (missing) analysts' estimates more than they 
did for meeting (missing) the other two thresholds. We provide several explanations for 
why investors unambiguously reward firms for reporting quarterly eamings that meet 
or beat analysts' estimates more than for meeting the other two thresholds late (but 
not early) in our sample period: increased medla coverage given to analyst forecasts, 
more analyst following, more firms covered by analysts, and temporal Increases In both 
the accuracy and precision of analyst forecasts. 
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I. INTRODUCTION 

egeorge et al. (1999) examine the relative importance of three quarterly earnings 

thresholds managers seek to achieve, and conclude that managers order their earn- 

ings thresholds as follows: (1) avoid quarterly losses; (2) avoid quarterly earnings 
decreases, and (3) avoid negative quarterly earnings surprises. Dechow et al. (2003) ex- 
amine these same thresholds using annual data, and show that avoiding negative annual 
earnings surprises is the most important threshold managers seek to achieve in the last three 
years of their sample, 1999—2001, but not before then. In contrast to Dechow et al. (2003), 
Graham et al. (2004) find that managers maintain that avoiding quarterly earnings decreases 
is the most important quarterly earnings threshold that they seek to achieve.? 

These collective findings give rise to several important unanswered questions. Does the 
managerial threshold hierarchy shown by Dechow et al. (2003) for annual earnings thresh- 
olds pertain to quarterly earnings thresholds in recent years? OH so, then the hierarchy 
documented by Degeorge et al. (1999) in their influential study is no longer applicable, and 
what managers say their priorities are regarding achieving quarterly thresholds (the Graham 
et al. [2004] evidence) differs from what they actually do.’ If the threshold hierarchy pro- 
posed by Degeorge et al. (1999) is no longer applicable, then when and why did the shift 
occur? Alternatively, if the Degeorge et al. (1999) finding that avoiding quarterly earnings 
decreases is sought after relatively more vis-à-vis avoiding negative quarterly earnings sur- 
prises is still valid, then what managers say they do regarding setting their earnings thresh- 
old priorities (the Graham et al. [2004] findings) is consistent with what they actually do, 
but threshold findings based on annual data (the Dechow et al. [2003] findings) do not 
apply to quarterly data. 

Consistent with Degeorge et al. (1999), we show that managers focus less on avoiding 
negative quarterly earnings surprises than on avoiding either quarterly losses or earnings 
decreases early in our sample period (1985—1993). However, in direct contrast to Degeorge 
et al. (1999), we show that avoiding negative earnings surprises is not the least important 
quarterly threshold after 1993. Indeed, it is the most important threshold in every year, 
1996—2002. 

The major reasons managers claim they seek to meet or beat quarterly earnings bench- 
marks is to build credibility with capital markets and to maintain or increase their firms' 
stock prices (Graham et al. 2004). Consistent with this view, we assume that managers act 
in their own best interests, and that their large holdings of stock, options, and fear of losing 
their jobs if their firms' stock prices drop sharply if they fail to achieve thresholds induce 
them to look to the capital markets when they consider which threshold to focus upon most 


e 


Using annual data, Burgstahler and Dichev (1997) obtain the same hierarchical ordering for the first two thresh- 
olds. Burgstahler and Dichev (1997) do not examine analyst forecasts (the third threshold). Degeorge et al. 
(1999) conclude that avoiding a loss is the most important threshold to achieve based on their conditional 
analyses (i.e., making a threshold when another threshold is made/missed), but their unconditional results reveal 
that avoiding an earnings decrease is the most important threshold. More importantly for the purposes of our 
study, the Degeorge et al. (1999) finding that avoiding a negative earnings surprise is the least important threshold 
is robust to their use of both conditional and unconditional methodologies. 

? Their survey of 312 financial executives from public firms reveals the following rank ordering of achieving 
earnings benchmarks: (1) same quarter last year (84.3 percent agree); (2) analyst consensus estimate of the 
current quarter (68.8 percent agree); (3) reporting a profit this quarter (65 percent agree); and (4) previous quarter 
EPS (54.6 percent agree). 

* According to the Social Sciences Citation Index, Degeorge et al. (1999) was cited 48 times as of June 22, 2004. 

As a frame of reference, we compare cites to Degeorge et al. (1999) with cites to all 19 articles published in 

the first issue of 1999 by the two premier finance journals, Journal of Finance and Journal of Financial Eco- 

nomics (Trieschmann et al. 2000). Only one article (Barber and Lyon 1999) was cited more often than Degeorge 

et al. (1999), 
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intently. We analyze incremental valuation consequences of making (missing) thresholds, 
and document that since the mid-1990s, but not before then, investors unambiguously re- 
ward (penalize) firms more for reporting quarterly earnings that meet (miss) analysts' es- 
timates than for meeting (missing) the other two thresholds. Our findings are consistent 
with the notion that managers took a cue from the capital markets to re-order their quarterly 
earnings threshold hierarchy. 

After documenting a temporal shift in investor reaction to quarterly earnings, we ex- 
amine why tbe shift occurred. We motivate our analyses based on studies showing that 
valuation consequences of avoiding negative earnings surprises increases in the accuracy 
(Fried and Givoly 1982; Brown et al. 1987; Wiedman 1996) and precision of analyst fore- 
casts (Imhoff and Lobo 1992; Kinney et al. 2002). We show that, during our sample period, 
analyst quarterly earnings forecasts have become both more accurate and more precise. 

We also document temporal increases in analyst following, firms followed by analysts, 
and media attention paid to meeting or beating analyst earnings estimates, providing ad- 
ditional evidence for why investors increased their focus on analyst earnings forecasts. 

We show that our findings are robust to five sensitivity checks: (1) using a ratio metric 
suggested by Dechow et al. (2003) in lieu of the Burgstahler and Dichev (1997) measure; 
(2) un-adjusting earnings data for stock splits; (3) requiring firms to be in the sample for 
the entire time period; (4) using Compustat actuals in lieu of I/B/E/S actuals; and (5) 
omitting observations where I/B/E/S earnings differ from GAAP earnings. 

Our findings are important for several reasons. First, we provide guidance for which 
. quarterly earnings threshold, academics, practitioners, investors, and regulators should focus 
on most if they wish to examine earnings management to achieve quarterly earnings thresh- 
olds in recent years. Second, we illustrate that the quarterly earnings threshold managers 
claim to be most important (Graham et al. 2004) differs from what they seek to achieve 
most, suggesting that what managers say they do differs from what they actually do.? Third, 
we show that our findings based on quarterly data confirm those based on annual data 
(Dechow et al. 2003) when both studies use similar methodologies, increasing the validity 
of both studies. Fourth, we provide an intuitively appealing economic rationale for why 
threshold hierarchies have changed (i.e., managers took a cue from the capital markets, 
which have made it clear to them that in recent years avoiding negative surprises is more 
value-relevant than avoiding either losses or earnings decreases), increasing both our un- 
derstanding of why managers altered their threshold priorities and our confidence in the 
threshold results. Fifth, we provide several reasons for why investors paid more attention 
to earnings surprises in recent years, increasing our understanding of why the valuation 
implications of making/missing thresholds has shifted in favor of avoiding negative earn- 
ings surprises (away from avoiding losses and earnings decreases) in recent years. 


H. RESEARCH QUESTIONS 
Degeorge et al. (1999) conclude that avoiding a negative earnings surprise is the least 
important of the three quarterly earnings thresholds managers seek to achieve. In contrast, 
Dechow et al. (2003) show that avoiding negative earnings surprises is the most important 


* Consistent with our findings, Libby and Kinney (2000) and Ng and Tan (2003) conduct experimental studies of 
auditors, and focus on the “avoid negative earnings surprise" threshold, ignoring both the “avoid loss" and 
“avoid earnings decrease" thresholds. 

5 Graham et al. (2004, 9) expected to find analyst quarterly earnings forecasts to be the single most important 
threshold: “Before administering the survey, we expected the analyst consensus estimate to be the most important 
earnings benchmark. However, the responses indicate that same quarter last year's EPS seems to dominate the 
analyst consensus estimate." 
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of the three annual thresholds in recent years. Consistent with Degeorge et al. (1999), but 
in contrast to Dechow et al. (2003), Graham et al. (2004) show that managers maintain that 
avoiding quarterly earnings decreases is more important to them than avoiding negative 
quarterly earnings surprises. These conflicting findings make it unclear whether use of a 
Degeorge et al. (1999) type of methodology (used by Dechow et al. [2003] but not Graham 
et al.[2004]) would show that, in contrast to Degeorge et al., avoiding a negative earnings 
surprise is the most important quarterly earnings threshold in recent years. Our first re- 
search question is: 


RQ: Is avoiding negative quarterly earnings surprises the threshold that managers seek 
most to achieve in recent years vis-à-vis avoiding quarterly losses or quarterly 
earnings decreases? 


After showing that managers have made avoidance of negative quarterly earnings sur- 
prises their most sought-after threshold in recent years (but not early in our sample), we 
appeal to economic intuition to examine why this phenomenon has occurred. We assume 
that managers' wealth (e.g., stock holdings, option holdings, and job security) is closely 
tied to their firms' stock prices so that managers alter their threshold hierarchy if the relative 
incremental valuation consequences of achieving thresholds change. If avoiding negative 
earnings surprises is the threshold with the greatest incremental valuation consequences in 
recent years (but not earlier in our sample period), then managers will alter their threshold 
prioritization in recent years in favor of avoiding negative earnings surprises. Our second 
research question is: 


RQ.: Have the valuation consequences shifted to reward (penalize) managers more for 
avoiding (reporting) negative quarterly earnings surprises than for avoiding (re- 
porting) quarterly losses or quarterly earnings decreases in recent years? 


After showing that avoiding negative earnings surprises is the threshold with the 
greatest incremental valuation consequences in recent years, we examine why investors 
reward (penalize) firms more for avoiding (reporting) negative quarterly earnings surprises 
vis-à-vis avoiding (reporting) losses or earnings decreases in recent years. Investors place 
more weight on more accurate earnings forecasts (Fried and Givoly 1982; Brown et al. 
1987; Wiedman 1996). If analyst forecasts have increased in their relative accuracy vis-à- 
vis those of a seasonal random walk model of quarterly earnings, then investors should 
increase their focus on avoiding negative quarterly earnings surprises relative to that of 
avoiding quarterly earnings decreases.’ Our third research question is: 


$ Dechow et al. (2003) use a Degeorge et al. (1999) type methodology, albeit with annual data. The Degeorge et 
al. (1999) type methodology is similar to the one originated by Burgstahler and Dichev (1997). Both Degeorge 
et al. (1999) and Graham et al. (2004) use quarterly data but, unlike Degeorge et al., Graham et al. adopt a 
survey approach. Quarterly earnings thresholds may differ from annual earnings thresholds for at least two 
reasons: (1) quarterly earnings are un-audited while annual earnings are audited; and (2) quarterly earnings 
thresholds present managers with short-term targets creating a higher-pressure environment vis-à-vis annual 
earnings thresholds (Eccles et al. 2001). Managers may be better able to manage earnings upward shortly before 
quarterly vis-à-vis annual earnings announcements because quarterly reports are unaudited, making it easier for 
managers to avoid quarterly losses and quarterly earnings decreases. In contrast, managers may be less able to 
manage annual earnings shortly before they are announced so they focus more on managing analyst expectations, 
allowing them to avoid negative earnings surprises but not necessarily to avoid losses or earnings decreases. 

7 A seasonal random walk model is defined as earnings changes, the threshold Degeorge et al. (1999) and Graham 
et al. (2004) find to be more sought after than analyst-based earnings surprises. 
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RQ,: Has the relative accuracy of analyst quarterly earnings forecasts vis-à-vis a sea- 
sonal random walk model increased in our sample period? 


Investor reaction to earnings thresholds may have changed in favor of earnings surprises 
due to increased precision in analyst forecasts. Imhoff and Lobo (1992) and Kinney et al. 
(2002) show that investors place more weight on forecasts that are more precise, so we 
examine if the precision of analysts' earnings forecasts has increased over time. If analysts' 
earnings forecasts have become more precise over time, then investors should focus more 
on them later than earlier in our time period. Our final research question is: 


RQ,,: Has the precision of analysts’ quarterly earnings forecasts increased during our 
sample period? 


IIl. DATA AND METHODOLOGY 
Sample Selection 

We obtain actual quarterly earnings, analyst forecasts of quarterly earnings, stock prices, 
and earnings announcement dates from Thomson Financial I/B/E/S for 19 years, 1984— 
2002. We require earnings lagged by four quarters to measure our second threshold, earn- 
ings changes, so our study period is 18 years, 1985-2002. We calculate three-day market- 
adjusted cumulative abnormal returns around quarterly earnings announcements and obtain 
SIC codes using CRSP data. 

We use the single most recent forecast made by the timeliest analyst(s) prior to the 
earnings announcement date as our analyst forecast? We define earnings changes as re- 
ported earnings in quarter t minus those in quarter z—4, and earnings surprises as reported 
earnings in quarter t minus the analyst's forecast of earnings in quarter t. We deflate earnings 
levels, earnings changes and earnings surprises by beginning of quarter t stock price.” All 
firm-quarter observations require reported earnings in both quarter t and quarter t—4, an 
analyst forecast of quarter t earnings, stock price at the beginning of quarter t, and returns 
data for the three days, —1 to +1, where day 0 is the earnings announcement day. 

Consistent with Burgstahler and Dichev (1997), we delete firms with SIC codes 4400— 
5000 (regulated industries) and 6000—6500 (financial institutions). To calculate analyst fore- 
cast dispersion, we take the standard deviation of analyst forecasts prior to the earnings 
announcement date, deflated by stock price (Kinney et al. 2002). For this analysis, we 
require firms to be followed by at least three analysts. To remove the effects of outliers, 
we delete three-day market-adjusted cumulative abnormal returns in the two extreme per- 
centiles.'? Our sample for the main analysis consists of 111,485 firm-quarter observations. 


8 When there is more than one analyst forecast on the most recent day, we take the mean of the distribution of 
analyst forecasts. We use the most recent forecast in lieu of the consensus forecast for several reasons. First, it 
is common in the literature to use the most recent forecast as a measure of expectations (Brown 2001; Bartov 
et al. 2002). Second, managers tend to pre-announce all the bad news but about half the good news (Soffer et 
al. 2000), so using the most recent forecast mitigates the confounding effects of earnings pre-announcements. 
Third, the most recent forecast is more closely related than is the mean consensus forecast to the stock price 
reaction to earnings announcements (Brown and Kim 1991) so it is a superior measure of the market’s earnings 
expectation. 

? Burgstahler and Dichev (1997) divided net income by beginning of year market value of common equity from 
Compustat, where income approximately equals earnings per share times the number of common shares, and 
market value of common equity approximately equals price per share times the number of common shares. We 
obtain qualitatively similar results when we deflate earnings levels and earnings surprises by beginning of quas 
t stock price and earnings changes by beginning of quarter t—4 stock price. 

10 We obtain qualitatively similar results when we do not delete the two extreme percentiles. 
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Methodology 


Our primary analysis uses a statistical test developed by Burgstahler and Dichev (1997) 
to examine managerial propensity to achieve thresholds.!! For each interval, we calculate 
the actual number of observations minus the expected number of observations, divided by 
the estimated standard deviation of this difference.? We compute a standardized difference 
of “the missing data" just to the left of zero for each threshold. For example, for the “avoid 
loss" threshold, we compare very small negative earnings levels (i.e., losses) with very 
small positive earnings levels (i.e., profits) to determine how many very small losses have 
been "shifted rightward” to become very small profits." Consistent with Burgstahler and 
Dichev (1997), we consider negative standardized differences with absolute magnitude of 
2.33 or more to be threshold achievement (p-value — 0.01 in a normalized distribution, 
one-tailed test), and the threshold with the largest negative standardized difference to be 
the most pervasive form of threshold achievement. We estimate an OLS regression of 
standardized differences on year to assess temporal changes in the managerial propensity 
to achieve thresholds.’ 

To examine valuation consequences of achieving thresholds, we regress market-adjusted 
cumulative abnormal returns (CAR) in the three days around quarterly earnings announce- 
ments on the eight mutually exclusive and collectively exhaustive categories of making/ 
missing thresholds dummy variables plus interaction variables for each of these dummy 
variables with unexpected earnings:!é 


CAR = XCategory dummy + B,*(TI- T2- T3-)*UE + B,*(TI* T2- T3-)*UE 
+ B,F(TI- T2* T3-)*UE + B,*(TI- T2- T3*)*UE 
+ B.*(T1* T2* T3-)*UE + B,*(TI* T2- T3** UE 
+ B,*(TI- T2* T3*)*UE + B,F(TI* T2* T3*Y*UE + u. (1) 


U To validate our calculation of the Burgstahler and Dichev metric, we replicated Burgstahler and Dichev (1997) 
using annual data and their time period and obtained nearly identical results. 

32 “The expected number of observations in an interval is assumed to be the average of the two immediately adjacent 
intervals. The variance is approximately Np, (1 — p) + (1/4) N (p. + p.) (1 — Pi-1_~ Di.) where N is the 
total number of observations and p, is the probability an observation falls into interval i. Burgstahler and Dichev 
(1997) provide details regarding the first two thresholds; Burgstahler and Eames (2003) provide details for the 
third threshold. We report results using an interval width of .0025, but our general inferences are the same with 
interval widths of .005 or .01. 

1? We include zeros as achieving thresholds in our primary analyses. However, zeros are biased toward on-target 
earnings announcements more than those that actually occur (Baber and Kang 2002) so we omit zeros in the 
"Split-Adjusted Data" section below when we conduct sensitivity analyses. 

^ The Burgstahler and Dichev test statistic represents the deficiency of expected observations in the bin just to 
the left of zero, indicating that observations have been shifted to the bin just to the right of (and including) 
zero. Although we do not report Burgstahler and Dichev test statistics just to the right of zero, untabulated 
results confirm that positive Burgstahler and Dichev test statistics for the bin just to the right of (and including) 
zero have increased significantly over time for the avoid negative earnings surprise threshold. 

15 The Burgstahler and Dichev test statistic increases in sample size (see Burgstahler and Eames [2003, 292, fn 
14]) so, after calculating the distribution across bins using the exact sample, we set all sample sizes equal to 
1,000 for all years and thresholds and use the actual proportion of observations in a bin to determine the number 
of observations given the sample size was 1,000. This grosses down the Burgstahler and Dichev test statistic, 
providing a conservative bias since our threshold/year sample sizes always exceed 1,000. More specifically, the 
avoid a loss threshold has mean (median) annual sample sizes of 6,130 (6,283) with a range of 1,952 to 9,461. 
The avoid an earnings decrease threshold has mean (median) annual sample sizes of 6,114 (6,266) with a range 
of 1,925 to 9,458. The avoid negative surprise threshold has mean (median) annual sample sizes of 6,161 (6,298) 
with a range of 1,962 to 9,483. We get similar results with sample sizes of 2,000 or 3,000. More specifically, 
the t-values of temporal regressions, R?s, and point where thresholds cross over are identical for sample sizes 
of 1,000, 2,000, or 3,000. 

16 For simplicity, we omit the i and z subscripts for the dependent and independent variables. 
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T1, T2, and T3 designate the avoid losses, earnings decreases, and negative surprise thresh- 
olds, respectively; the superscripts, —/+, indicate if a threshold has been missed or made; 
XCategory dummy are categories of making/missing thresholds dummy variables; and UE 
is unexpected earnings, defined as actual earnings minus the consensus analyst forecast 
(deflated by beginning of quarter z stock price). Each category is represented by a dummy 
variable (1 if the firm-quarter z observation falls into that category, and 0 otherwise). For 
example, if a firm makes zero (all three) thresholds, the 77~ 72^ 73^ (T1* T2* T3*) dummy 
variable equals 1. The eight categorical dummy variables collectively represent the intercept 
of our model (i.e., the eight dummy vectors sum to the unit vector). Each dummy variable 
is interacted with the magnitude of unexpected earnings deflated by stock price (an earnings 
response coefficient). | 

To provide evidence on the incremental valuation consequences of achieving thresholds, 
we estimate an OLS regression of Equation (1) each year and obtain dummy and interaction 
coefficients. We calculate the average valuation consequence of making/missing a threshold 
by summing the value of the dummy variable for that threshold/year and the product of 
the appropriate interaction term coefficient and the mean level of deflated unexpected earn- 
ings for that threshold combination category for that year. We determine incremental val- 
uation consequences by taking differences of these valuation consequences. For example, 
to obtain the positive incremental valuation consequences for making the avoid loss thresh- 
old when no threshold has been made, we subtract ((dummy variable for TI T2- T3 ) 
+ B,*Mean UE) from ((dummy variable for TI* 72^ T3^) + 8,*Mean UE). To obtain the 
negative incremental valuation consequences for missing the avoid loss threshold when all 
three thresholds have been made, we subtract ((dummy variable for TI* 72* T3") 
+ B,*Mean UE) from ((dummy variable for 77~ 72* T3*) + B;*Mean UE). 


IV. MAIN RESULTS 
Temporal Evidence on Threshold Hierarchy 


Figure 1 shows temporal changes in the three managerial thresholds. Consistent with 
similar findings in Brown (2001), Bartov et al. (2002), Lopez and Rees (2002), Matsumoto 
(2002), and Dechow et al. (2003), we find the propensity to avoid negative surprises has 
increased over time (slope = —1.218, t-value = —13.91).? Consistent with Burgstahler and 
Eames (2003), we find the propensities to avoid losses (slope = —0.0361, t-value = —1.63) 
and earnings decreases are unchanged over time (slope = —0.1024, t-value = —1.62). A 
Chow test reveals that the slope of the propensity to avoid negative surprises exceeds that 
of the slope of the propensities to avoid either losses or earnings decreases at the 1 percent 
significance level or better. 

Consistent with Degeorge et al. (1999), managers seek to avoid losses or earnings 
decreases more than they do to avoid negative earnings surprises in each of the first nine 
years of our sample period, 1985—1993. However, in contrast to Degeorge et al. (1999), 
managers seek to avoid negative earnings surprises more than to avoid losses or earnings 


The 111,485 observations fall into the eight categories as follows (ordered as in the CAR equation with percent 
of total in parentheses): 8,367 (7.51 percent), 13,483 (12.09 percent), 1,923 (1.73 percent), 4,614 (4.14 percent), 
14,697 (13.18 percent), 14,136 (12.68 percent), 3,080 (2.76 percent), and 51,185 (45.91 percent). 

18 Negative standardized differences indicate a propensity to achieve a threshold so a negative coefficient indicates 
an increased temporal propensity to do so. The interpretation of positive standardized differences in the early 
years is the well-known optimism documented in the literature for analyst forecasts in earlier time periods 
(O'Brien 1988; Das et al. 1998). 
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FIGURE 1 
Temporal Analysis:of Quarterly Earnings Thresholds 
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Temporal analysis of standardized differences. Yearly standardized differences are computed by first 
determining the distribution of deflated earnings levels, changes, and surprises in bins with .0025 
widths. After setting sample size equal to 1000, standardized differences are computed as the 
difference between the actual and expected number of observations in a bin. Standardized 
differences for the bin just to the left of zero are plotted. A standardized difference with a negative 
value that has an absolute value of 2.33 or greater indicates threshold achievement. Slopes are 
computed using ordinary least squares regressions of standardized differences on year. For example, 
the slope of the avoid negative earnings surprise threshold is —1.218 with a t-statistic of —13.91 (p- 
value = 0.001), indicating that the propensity to avoid negative earnings surprises has increased 
significantly over time. This model provides 92 percent explanatory power. 


decreases in each of the last seven years of our study, 1996—2002.!? The results are strongly 
consistent with our first research question. Consistent with the findings of Dechow et al. 
(2003), but not with those of Graham et al. (2004), managers have a greater propensity to 
avoid negative quarterly earnings surprises in recent years than they do to avoid either 
losses or earnings decreases. 

The Figure 1 evidence that avoiding earnings surprises is the most (least) sought-after 
threshold late (early) in our sample period suggests a temporal increase in management of 
analysts’ quarterly earnings estimates.” Bartov et al. (2002) show that expectations man- 
agement in the second half of their sample exceeded that in the first half of their sample. 


1? Beaver et al. (2003) argue that the discontinuity overestimates the actual amount of earnings management for 
avoiding both losses and earnings decreases. If such an overestimation pertains to our findings, the cross-over 
point may actually occur earlier in time than we suggest, but our major point remains: avoiding negative earnings 

is the most sought-after threshold in recent years. An alternative explanation for our finding is that 
analyst forecasts have become timelier and thus less optimistic in recent years. To address this issue, we regress 
forecast timeliness (its proximity to earnings announcement date) on year. Finding no significant temporal trend, 
we reject this alternative interpretation of our results. 

20 Avoiding losses and avoiding earnings decreases can be achieved via earnings management, but not by expec- 
tations management. In contrast, avoiding negative earnings surprises can be accomplished both by earnings 
management and expectations management. 
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To add validity to their study and our Figure 1 findings, we use their methodology to 
calculate a proxy for the aggregate level of expectations management for each year in our 
sample period. For each year, we compute the difference in proportions of negative forecast 
error cases that end with a zero or positive surprise minus the proportion of positive or 
zero forecast error cases that end with a negative surprise (Bartov et al. 2002, H.).?! For 
simplicity, we do not provide a figure, but we correlate the Bartov et al. (2002) statistic for 
each year with absolute values of the three numbers in Figure 1 for the same year: 75, 
T3—T2, and T3—TI. The statistically significant Pearson correlations of 0.99, 0.97, and 
0.98, respectively, provide strong evidence that expectations management explains why 
management of earnings surprises has increased vis-à-vis management of earnings levels 
and earnings changes.^ 


Temporal Evidence on the Valuation Consequences of Making Thresholds. 


Table 1 provides results of our yearly regressions. The intercept terms show the valu- 
ation consequences of missing (making) all three thresholds are negative (positive) every 
year, adding validity to our procedures. To see how avoiding negative surprises has become 
the threshold with the largest incremental valuation consequences in recent years, consider 
the coefficient estimates for our calculation of incremental valuation consequences for 
achieving one threshold (T,, L, L,, L,, Bi 85, B4, and B4). If we only consider intercept and 
slope coefficients significant at the 5 percent (one-tailed) level or better in our calculations 
of incremental valuation consequences of making one threshold, then we can compare 
incremental valuation consequences of avoiding negative earnings surprises with avoiding 
losses and earnings decreases for the years 1994—2000. With the exception of 1995 in 
which there is a higher but insignificant coefficient for avoiding a negative earnings surprise, 
we find that for every other year, avoiding a negative earnings surprise unambiguously has 
the highest incremental valuation consequences. If we only consider significant intercept 
and slope coefficients when calculating the incremental valuation consequences for missing 
one threshold, then we find that during the 1993—2002 time period, not avoiding a negative 
earnings surprise has the most negative consequences and all of the avoid negative earnings 


?! Forecast error is defined as forecast revision plus earnings surprise. Consistent with Bartov et al. (2002), we 
define revision as the last analyst forecast for quarter t before quarter t's earnings announcement minus the first 
analyst forecast for quarter ¢ after the previous quarter’s earnings announcement. Similar to our definition of the 
analyst forecast in our primary analysis, if there is more than one forecast on the day of the first forecast, then 
we take an average of the forecasts. To be consistent with Bartov et al. (2002), we require at least 20 trading 
days in between these two forecasts. Due to these additional data restrictions, our sample size is reduced to 
70,530 observations. Although we use a different sample of firms due to our imposing additional constraints, 
we validate our calculation of Bartov et als (2002) metric by computing proportions for the 1985-1993 and 
1994—1997 subperiods. The proportion for 1985-1993 of 13.53 percent compares with 9.33 percent in Bartov 
et al. (2002) for their 1983—1993 subperiod. The proportion for 1994—1997 of 35.20 percent compares with 
31.37 percent for their same 1994—1997 subperiod. The proportion for 1998—2002 (not included in Bartov et 
al. [2002]) is 50.98 percent. - 

22 We also examined the relative frequency of negative surprises versus the relative frequency of negative forecast 
errors to see if the relative frequency of the latter exceeds the former, another test of expectations management 
(Bartov et al. 2002, Ho. We correlated this alternative Bartov et al. (2002) statistic for each year with absolute 
values of the three numbers in Figure 1 for the same year: 73, T3—72, and T3— T1. The Pearson correlations 
of 0.86, 0.87, and 0.85 suggest that expectations management explains why management of earnings surprises 
has increased vis-à-vis management of earnings levels and earnings changes. Once again we validate our metric 
by comparing cur subperiod results to Bartov et al.’s (2002) subperiod results. The proportion for 1985-1993 
is 8.70 percent, which mirrors closely that found for Bartov et al.’s (2002) 1983—1993 subperiod of 8.48 percent. 
The proportion for 1994—1997 is 13.76 percent, which is nearly the same as 13.66 percent for their 1994—1997 
subperiod. We further calculate the proportion for 1998-2002 (not included in Bartov et al. [2002]) to be 19.26 


percent. 
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surprise coefficients are significant, whereas the coefficients for the other two thresholds 
are insignificant for some years.” 

Figure 2, Panel A presents a temporal analysis of the incremental valuation conse- 
quences of making one threshold. It is evident that before 1993, avoiding a negative surprise 
is not the obvious threshold to achieve first. Avoiding a negative surprise has the largest 
incremental valuation consequence in four of the first eight years of our sample period, 
1985-1992. However, for all of the last ten years of our sample period (1993—2002), 
avoiding negative surprises has the largest incremental valuation consequence, making it 
clear that it is the threshold managers should seek to achieve most. 

Figure 2, Panel B presents a temporal analysis of the incremental valuation conse- 
quences of missing one threshold. In the first four years of our sample period, 1985-1988, 
the valuation consequences of the threshold that is best to avoid missing is unclear. How- 
ever, the valuation consequence of the threshold that is best to avoid missing is clear 
thereafter. In all of the last 14 years of our sample period, 1989—2002, reporting a negative 
surprise has the largest negative incremental valuation consequence. 

The evidence is consistent with our second research question. It is clear that the positive 
(negative) valuation consequences of making (missing) thresholds are greatest for the third 
threshold in recent years but not in earlier years. Our results are consistent with the notion 
that managers shifted their threshold hierarchy (RQ,) in response to changes in the incre- 
mental valuation implications of making/missing thresholds (RQ.). 


Shifts in Relative Accuracy and Precision of Analyst Forecasts 


Before examining the temporal changes in the relative accuracy and precision of analyst 
forecasts (RQ, and RQ,), we examine several other potential explanations for why investors 
may have shifted their focus in favor of avoiding negative earnings surprises. We examine 
if there was a temporal increase in the prevalence of analyst forecasts during our time 
period, and we summarize below (but do not tabulate) three types of related evidence. 


(1) There is a temporal increase in the number of analysts in the I/B/E/S database 
over our time period (slope = 172.5, t-value = 14.19). 

(2) There is a temporal increase in the number of firms in the I/B/E/S database over 
our time period (slope — 175.4, t-value — 5.75). 

(3) There is a temporal increase in media attention (slope — 912.87, t-value — 12.69) 
paid to analyst forecasts over our time period.24 


It is evident that the pervasiveness of analyst forecasts exploded during our time frame, 
increasing access to, and likely enhancing investor focus upon, analyst forecasts.” 

To address our third research question, we examine whether analysts' earnings forecasts 
have become more accurate vis-à-vis a seasonal random walk model of quarterly earnings 
over our sample period. For this purpose, we conduct a temporal analysis of analyst forecast 
accuracy (the basis of our third threshold) vis-à-vis the accuracy of a seasonal time series 
model of quarterly earnings (the basis of our second threshold). Figure 3, Panel A plots 


72 We present all years in Figure 2 for presentation purposes, noting that we may be comparing insignificant 
coefficients for some years. However, as shown above, our main inference of the increased capital market 
importance of avoiding a negative earnings surprise is robust to considering only significant coefficients in the 
computation of incremental valuation consequences. 

^ We conducted a keyword search of analyst forecasts using the FACTIVA database. We performed keyword 
searches for analyst estimates, meet or beat earnings expectations, and consensus forecast and found similar 
results. 

25 The increased pervasiveness of analyst forecasts probably increased investor attention on analysts, while in- 
creased investor attention on analysts probably increased the pervasiveness of analyst forecasts. 


The Accounting Review, April 2005 


434 Brown and Caylor 


FIGURE 2 
Jncremental Valuation Consequences of Quarterly Earnings Thresholds 
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Panel B: Missing One Threshold 
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We estimate regression Equation (1) by year to obtain yearly coefficients for all eight mutually 
exclusive, collectively exhaustive categories of achieving thresholds. We then calculate the valuation 
consequence of making/missing a threshold by summing the value of the dummy variable for that 
threshold/year and the product of the appropriate interaction term coefficient and the mean level of 
deflated unexpected earnings, where UE is defined as actual earnings minus consensus analyst 
earnings forecast, for that threshold combination category for that year. We determine incremental 
valuation consequences by differencing the appropriate valuation consequences. 


(continued on next page) 
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FIGURE 2 (continued) 


Panel A presents a temporal analysis of the incremental (average) total valuation consequences of 
making one threshold. For example, for making the avoid loss threshold when no threshold has 
been made, we subtract ((dummy variable for T7 T2- 73 ) + B,*Mean UE) from ((dummy 
variable for 71* T2- T3) + BR *Mean UE). 

Panel B presents a temporal analysis of the incremental (average) total valuation consequences of 
missing one threshold. For example, for missing the avoid loss threshold when all three thresholds 
have been made, we subtract ((dummy variable for TIt 72* T3*) + p,*Mean UE) from ((dummy 
variable for TI- T2* T3*) + B, *Mean UE). 

We compute the slopes using ordinary least squares regressions of valuation consequences on year. 
For example, in Panel A, the slope of the avoid earnings decrease threshold is 0.05 percent with a 
t-statistic of 1.47. The model which regresses the avoid earnings decrease incrementa] valuation 
consequences on the integers 1 to 18 for years — 1985 to 2002 provides 6 percent explanatory 
power. 


the median of the distribution of absolute values of the ratio of analysts' earnings surprises 
to earnings changes." The adjusted R? of the regression of this ratio on time is 92 percent, 
and the t-value on the time variable is —14.22, providing strong evidence of a temporal 
increase in the relative accuracy of analyst forecasts vis-d-vis that of a seasonal random 
walk time-series model. 

To address our fourth research question, we examine whether the precision of analysts' 
quarterly earnings forecasts has increased during our sample period. Figure 3, Panel B plots 
the median forecast dispersion over time.” The adjusted R? is 87 percent and the t-value 
on the time variable is —10.64, providing strong evidence of a temporal increase in the 
precision of analysts' quarterly earnings forecasts. Our Figure 3 findings are consistent with 
our third and fourth research questions. 


V. SENSITIVITY ANALYSES 
We now address five potential validity threats: (1) use of the procedure suggested by 
Burgstahler and Dichev (1997); (2) use of split-adjusted data, (3) changing composition of 
sample firms during our time period; (4) using I/B/E/S actuals to measure earnings levels 
and changes; and (5) using non-GAAP earnings. For simplicity, we do not tabulate results 
using procedures that mitigate these validity threats, but we summarize our findings below. 


Ratio Statistic 


In our primary analysis, we computed a test statistic setting sample sizes equal to 1,000 
(see footnote 15) to avoid using test statistics that are sensitive to sample size. Because this 
procedure ignores actual sample sizes, we employ a nonparametric statistic used by Dechow 
et al. (2003), whose magnitude is unaffected by sample size. We compute the ratio as the 
actual number of observations in the bin just to the right of (and including) zero divided 
by the actual number of observations in the bin just to the left of zero. We obtain quali- 
tatively similar results as in Figure 1 except that the avoid negative surprise threshold 
becomes the most sought after threshold two years later (in 1998). Similar to Figure 1, 


?$ For this analysis, we delete any observation where the earnings surprise or earnings change is equal to 0. 
27 To ensure a reasonable number of observations are available to measure dispersion, we require that at least three 
analysts follow the firm. 
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FIGURE 3 
Relative Accuracy and Forecast Dispersion over Time 


Panel À: Relative Accuracy of the Forecast Metrics over Time 
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Panel A presents a temporal analysis of the median ratio of earnings surprises to earnings changes. 
Sample size is 93,829 observations. For each year we compute the median of the distribution of 
absolute values of the ratio of earnings surprises to earnings changes for all firm-quarters in a 
particular year. 

Panel B presents a temporal analysis of forecast dispersion. Sample size is 82,473. For each year 
we compute the median forecast dispersion for all firm-quarters in a particular year. Consistent with 
Kinney et al. (2002), we define dispersion of analyst forecasts as the standard deviation of forecasts 
as of the last consensus forecast update before the quarterly earnings announcement, scaled by 
price per share. We also require that at least three individual analysts make a forecast at that date. 
We compute the slopes using ordinary least squares regressions of the median ratio of earnings 
surprises to earnings changes or the median forecast dispersion on year. We report the slope 
(t-value) and adjusted R2 of these regressions. 
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avoiding negative earnings surprises is the only threshold with a significant temporal in- 
crease (slope = 0.251, t-value = 11.32). 


Split-Adjusted Data 

Baber and Kang (2002) show that errors induced by rounding split-adjusted data may 
alter conclusions about how investors react to earnings that exactly meet analyst expecta- 
tions. We mitigate this potential validity threat in two different ways. First, we delete all 
observations with earnings levels, earnings changes, or earnings forecast errors equal to 
zero cents per share, reducing our sample size to 93,273. Second, we use the I/B/E/S 
procedure and file (with adjustment factors) for un-adjusting for the stock splits. Both our 
threshold and valuation consequence results are qualitatively unchanged using either pro- 
cedure, indicating that our results are not due to our failure to adequately account for stock 
splits. 


Same Firms in Sample during Entire Time Period 


Our results may arise from a shift in composition of sample firms during our time 
period. To mitigate this potential validity threat, we restrict our analyses to firms that are 
in our sample during our entire time period, reducing our sample size to 24,016. Our 
threshold and valuation consequence results are qualitatively similar when we limit our 
sample to these firms, suggesting that our results are not due to temporal shifts in sample 
composition. 


Compustat versus I/B/E/S Actuals 


I/B/E/S is primarily a source of earnings forecast data, not a vendor of actual earnings 
numbers so there is likely to be less measurement error in computing the third threshold, 
especially in recent years when the analyst data providing business has become so com- 
petitive, when First Call came on the scene in the early 1990s. To examine the seriousness 
of this validity threat, we re-compute the first two thresholds using quarterly Compustat 
earnings data (Compustat data item 9 or 19, depending on whether the I/B/E/S forecast 
data is reported on a diluted or primary (basic) basis, respectively) in lieu of I/B/E/S 
earnings data, deflated by Compustat stock price, reducing our sample size to 89,628.” We 
obtain qualitatively similar threshold avoidance and valuation consequence results using 
Compustat data to compute our first two thresholds, revealing that our results are not 
attributable to use of I/B/E/S actuals to compute our first two thresholds. 


Pro Forma versus GAAP Earnings 


The change in relative accuracy we document may result from analysts predicting pro- 
forma earnings rather than GAAP earnings more over time (Bradshaw and Sloan 2002; 
Bhattacharya et al. 2004). Use of non-GAAP earnings (i.e., I/B/E/S actuals) may “stack 
the deck” in favor of analyst forecasts and actuals (i.e., our third threshold), especially in 
recent years when the use of pro forma has become so pervasive. To mitigate this validity 
threat, we examine the relative accuracy of analyst forecasts vis-à-vis that of a seasonal 
random walk model over time for the subset of firms where I/B/E/S actuals equal S&P 


?* Philbrick and Ricks (1991) compare Value Line and I/B/E/S actuals with Compustat earnings numbers. Finding 
more concordance of the Value Line actuals with the Compustat actuals, they conclude that Value Line provides 
better actuals. Ramnath et al. (2005) show that Value Line does not provide better actuals than I/B/E/S in 
recent years. 

29 We continue to use I/B/E/S data to compute our third threshold because we do not wish to mix actuals and 
forecasts from different sources (Philbrick and Ricks 1991). 
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GAAP earnings (Compustat data item 9 or 19, depending on whether the I/B/E/S forecast 
data is reported on a diluted or primary (basic) basis, respectively). We find a significant 
temporal increase in relative accuracy even for this subset of firms. The adjusted R? of the 
regression of this ratio on time is 72 percent and the t-value on the time variable is —6.66. 
It is evident that our results are not driven by use of pro forma earnings numbers in lieu 
of GAAP earnings. 


VI. CONCLUSIONS AND IMPLICATIONS 

Degeorge et al. (1999) use pooled temporal, cross-sectional data, and conclude that 
managers seek to avoid quarterly losses and earnings decreases more than to avoid negative 
earnings surprises. À 2003 survey by Graham et al. (2004) finds that managers attest that 
they seek to avoid quarterly earnings decreases more than to avoid negative quarterly earn- 
ings surprises. In contrast to these survey results, Dechow et al. (2003) apply a Degeorge 
et al. (1999) type methodology to annual data, and find that managers are more prone to 
avoid negative annual earnings surprises than to avoid either losses or earnings decreases 
in the last three years of their study, 1999—2001. These conflicting results lead us to re- 
examine threshold hierarchies applying a Degeorge et al. (1999) type methodology to quar- 
terly data. 

We confirm Degeorge et al's (1999) results earlier in our sample period, but we find 
precisely the opposite later in our sample period. Our findings reveal that the quarterly 
hierarchy proposed by Degeorge et al. (1999) is no longer valid, and that the threshold 
managers maintain they seek to achieve most (Graham et al. 2004) is inconsistent with the 
threshold that they actually seek to achieve most. We appeal to economic intuition to 
examine why managers changed their threshold prioritization. We assume that managers 
are wealth-maximizers who recognize that their wealth (i.e., stocks, options, and human 
capital) is adversely impacted when their firms' stock prices drop if they fail to achieve 
thresholds. Our finding that capital markets unambiguously reward (sanction) firms more 
for making (missing) analyst forecasts late, but not early, in our time period, is consistent 
with the notion that managers took a cue from the capital markets to alter their threshold 
prioritization in favor of avoiding negative quarterly earnings surprises. 

We examine why investors changed their focus in favor of analyst forecasts. We find 
a significant. increase over our sample period in the number of analysts, the number of 
firms followed by analysts, the amount of media attention paid to analyst forecasts, the 
relative accuracy of analyst forecasts vis-à-vis that of a seasonal random walk model, and 
the precision of analyst forecasts. All of these factors may have induced investors to pay 
more attention to analysts' forecasts later than earlier in our time period. 

Our findings are robust to five sensitivity checks: (1) using a ratio metric suggested by 
Dechow et al. (2003) in lieu of a Burgstahler and Dichev (1997) measure; (2) un-adjusting 
earnings data for stock splits; (3) requiring firms to be in the sample for the entire time 
period; (4) using Compustat actuals in lieu of I/B/E/S actuals; and (5) omitting observa- 
tions where I/B/E/S earnings differ from GAAP earnings. These results increase our con- 
fidence in our findings of temporal shifts in managers' threshold hierarchy and the incre- 
menta] valuation consequences thereof that favor avoidance of negative quarterly earnings 
surprises. 

We close with two caveats. First, our findings are subject to the validity of our meth- 
odologies. It is possible that the survey results of Graham et al. (2004) are more valid than 
the Burgstahler and Dichev (1997)/Degeorge et al. (1999)/Dechow et al. (2003) type meth- 
odologies to examine thresholds. However, it is at least as plausible that managers report 
in surveys that they focus on one thing (avoid quarterly earnings decreases) while they 
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actually focus more on something else (avoid quarterly negative surprises). Managers may 
do this unwittingly or they may report something in a survey that they believe the surveyor 
wants them to state. Second, we do not definitively show that managers altered their thresh- 
old prioritization because investors changed the thresholds they reward (sanction) the most. 
Similarly, we do not definitively show that investors altered the threshold they reward 
(sanction) most for making (missing) because of the proliferation of analyst forecasts, media 
attention paid to analysts, and/or the increase in the accuracy and precision of analyst 
forecasts. i 
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ABSTRACT: This paper examines the link between managers’ equity incentives— 
arising from stock-based compensation and stock ownership—and eamings manage- 
ment. We hypothesize that managers with high equity incentives are more Ilkely to sell 
shares in the future and this motivates these managers to engage in earnings man- 
agement to increase the value of the shares to be sold. Using stock-based compen- 
sation and stock ownership data over the 1993-2000 time period, we document that 
managers with high equity Incentives seil more shares in subsequent periods. As ex- 
pected, we find that managers with high equity incentives are more likely to report 
eamings that meet or just beat analysts' forecasts. We also find that managers with 
consistently high equity incentives are less likely to report large positive earnings sur- 
prises. This finding Is consistent with the wealth of these managers being more sen- 
sitive to future stock performance, which leads to increased reserving of current earn- 
ings to avoid future eamings disappointments. Collectively, our results indicate that 
equity Incentives lead to incentives for earnings management. 
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I. INTRODUCTION 

egulators and investors have raised concerns that certain management incentives 
R< lead to earnings management, reducing the informativeness of financial re- 

porting and contributing to recent corporate scandals (Levitt 1998; Knowledge at 
Wharton 2003). Of particular recent concern is that stock-based compensation might induce 
managers to increase the short-term stock price through earnings management. For example, 
Michael Jensen argues that stock-based compensation and other equity incentive elements 
can encourage managers to increase short-term stock prices in order to benefit from sub- 
sequently selling shares of their own firms’ stock (The Economist 2002).! Indeed, these 
concerns have contributed to new calls for the expensing of stock options, other regulatory 
constraints on compensation practices, and improved corporate governance (FASB 2004; 
Morgenson 2003; Conference Board 2002). However, there is little empirical evidence on 
the relation between equity incentives and earnings management. The purpose of this paper 
is to examine this relation, thereby providing evidence that can be used to corroborate or 
refute allegations of equity-incentive-related earnings management. 

Modern corporations employ various mechanisms to remedy the adverse consequences 
arising from the separation of ownership and control (Jensen and Meckling 1976). En- 
couraging managerial ownership is an example of such mechanisms. Theoretically, as man- 
agers own more shares, they are more likely to act in the interests of shareholders. One 
way to increase the level of managerial ownership is to award managers options or shares 
of stock (Core and Guay 1999). Prior research examines the association of managerial 
ownership and stock-based compensation with future firm performance, and finds evidence 
consistent with the incentive-alignment effect of these equity incentive elements (Lambert 
and Larcker 1987; Morck et al. 1988; Hanlon et al. 2003, among others). However, at the 
same time, these elements can lead managers to focus on short-term stock prices, thereby 
leading to incentives for earnings management. 

The incentives for earnings management arise from managers' diversifying the in- 
creased risk associated with ownership or stock-based compensation. For example, Ofek 
and Yermack (2000) examine the dynamics of ownership and stock-based compensation, 
and find that when managers are awarded stock-based compensation, they tend to sell shares 
they already own for risk diversification reasons. Managers ere likely to continue selling 
shares in the future if the risk exposure due to equity incentives is, or is expected to be, 
above the level that managers are willing to bear. The risk exposure can increase in the 
future because when stock prices increase, managerial wealth becomes more concentrated 
in the stock and managers' option holdings become riskier (i.e., more sensitive to stock 
prices). 

Because of this selling, a manager's wealth is sensitive to the short-term stock price, 
which can motivate managers with high equity incentives to Increase the short-term stock 
price. Given that the capital market uses current earnings to predict future earnings when 
pricing firm equity, these managers are expected to use their accounting discretion to man- 
age earnings in order to keep the short-term stock price high (Stein 1989). 

In this paper, we explicitly examine the links from equity incentives to earnings man- 
agement. In particular, we examine the following equity incentive elements: option grants, 
unexercisable options, exercisable options, restricted stock grants, and stock ownership. 
Because the positive relation between equity incentives and managers' future sales of their 
own firms' stock underpins the earnings management argument, we first document the 





! Dechow and Skinner (2000) also express concerns that when stock market valuations are high, managers have 
incentives to maintain or even increase those valuations, probably via earnings management. 
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existence of this relation. We measure a manager's future net sales as the difference between 
open market sales and the sum of open market purchases and the shares received from 
exercising options. The results indicate that, as expected, managers with high equity incen- 
tives have significantly higher levels of net sales in the year after earnings announcements.? 

Next, we investigate the association between equity incentives and earnings manage- 
ment by examining the extent to which managers report earnings that meet or just beat 
analysts' forecasts. As expected, we find a significantly higher incidence of meeting or just 
` beating analysts’ forecasts for firms with higher managerial equity incentives. Based on 
analyses that control for firm performance and other potential confounds, we conclude that 
our results are more consistent with earnings management induced by equity incentives as 
opposed to better firm performance attendant to incentive alignment benefits or to equity 
incentive elements correlating with other fixm characteristics. 

We then examine whether managers who predictably engage in earnings management 
sell more shares after actual earnings management. We find that managers with high equity 
incentives sell more after meeting or beating analysts' forecasts than after missing analysts' 
forecasts. In contrast, we do not find such evidence for managers with low equity incentives. 
These results are consistent with high equity incentive managers increasing sales after 
earnings management. 

We also conduct additional analyses to enrich our results on earnings management. 
First, equity incentives can be recurring; managers with high equity incentives in the current 
period are more likely to have high equity incentives in the future. Thus, managers with 
high equity incentives can benefit from smoothing earnings. By reserving in good years, 
these managers can more easily meet or beat analysts' forecasts in future periods. Consistent 
with this argument, we find that high equity incentive managers are less likely to report 
large positive earnings surprises compared to those with low equity incentives. The evidence 
is particularly strong for firms with more persistent equity incentives. Second, we find that 
high equity incentive managers, on average, report more income-increasing abnormal ac- 
cruals, especially for managers with less persistent equity incentives—those who are less 
concerned with the reversal of accruals. We also find that managers sell more shares after 
taking income-increasing abnormal accruals. Last, our findings are robust to alternative 
measurements of key variables, the inclusion of other control variables, and alternative 
regression specifications. 

Overall, consistent with regulators’ and investors’ concerns about equity incentives, we 
find that high equity incentive managers are more likely to engage in earnings management, 
relative to managers with low equity incentives. This evidence contributes to the compen- 
sation literature by extending research on compensation-related opportunistic behavior. By 
documenting one potential consequence of stock-based compensation and ownership, this 
paper should be of interest to boards of directors contemplating compensation contracts for 
managers. While stock-based compensation and ownership can yield positive incentive 
alignment effects, optimal compensation contracts should reflect the earnings management 
consequences arising from these equity incentives. 

This paper also contributes to the earnings management literature. Although earnings 
management has received considerable attention in the accounting literature, little is known 
about the earnings management incentives arising from stock-based compensation or own- 
ership. This paper therefore extends research on compensation-related earnings management 


? In our main analyses, we focus on CEOs because they are the ultimate decision makers. Results based on all 
managers with data available are similar. 
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beyond that related to bonus contracts as documented in Healy (1985) and Warfield et al. 
(1995). 

Our paper is related to other concurrent studies of the capital market pressure on man- 
agers due to equity incentives. Ke (2001) finds that managers' equity incentives are posi- 
tively correlated with the duration of a string of quarterly earnings increases and Skinner 
and Myers (2000) find that firms tend to manage earnings to maintain a string of earnings 
increases. Beneish and Vargus (2002) find that income-increasing accruals, when accom- 
panied with insider sales, are of lower quality, consistent with the notion that managers 
might manage earnings upward and then sell their own shares. We extend these studies by 
explicitly linking equity incentives to financial reporting attributes that reflect earnings 
management (meeting or just beating analysts’ forecasts, avoiding large positive earnings 
surprises, and recognizing abnormal accruals) and by providing direct evidence of managers 
trading in response to such earnings management.? 

The remainder of the paper is organized as follows. In the next section, we develop 
our hypothesis concerning equity incentives and earning management. In Section III, we 
describe the sample and data with primary results presented in Section IV. Section V dis- 
cusses further analyses that provide additional insights and ensure the robustness of our 
results; Section VI concludes with a summary. 


IIl. HYPOTHESIS DEVELOPMENT 
The theoretical linkages for our study are illustrated in Figure 1. As indicated, the link 
between equity incentives and managers' future trading is important because such trading 
motivates managers to care about short-term stock prices, resulting in earnings management 
incentives. In this section, we first discuss the relation between equity incentives and man- 
agers' future trading of their own firms' stock, and then present our hypothesis on earnings 
management. 


Equity Incentives and Their Relation to Future Manager Trading 

Managerial ownership is an important mechanism to align managers' incentives with 
those of shareholders (Jensen and Meckling 1976; Morck et al. 1988). One way to increase 
ownership is through stock-based compensation. When a manager is granted options, the 
options usually are not exercisable until three or four vears later. When options become 
exercisable (i.e., vested), managers can choose to hold exercisable options or to exercise 
the options and hold shares instead. (To finance option exercises, managers generally sell 
the shares received from exercising options.) Managers are also awarded restricted stock, 
which, like option grants, does not vest (1.e., cannot be traded) until after three or four 
years. Another way to obtain shares is through open market purchases, which is common 
in early career stages (Core and Larcker 2002). 

These various elements and their interrelationships are summarized in Figure 2. As 
indicated, at any point in time, such as at the fiscal-year-end, managers might have various 
forms of equity-based holdings: unexercisable options, exercisable options, and stock own- 
ership. Due to these equity-based holdings, managers' wealth is sensitive to their firms' 
stock prices and managers therefore bear the idiosyncratic risk of the firm. 


* Concurrent research by McVay et al. (2004) examines insider trading following meeting and beating analysts' 
forecasts. Unlike this paper, McVay et al. (2004) do not examine ex ante incentives for earnings management 
resulting from stock-based compensation or ownership. 
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FIGURE 1 
Theoretical Links 


Equity incentives Earnings management 
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Future manager trading 
Proxies: 


Net sales of shares in the future 





This figure depicts the theoretical constructs and the links between them. For each theoretical 
construct, we list empirical proxies used in the analyses. Tests of Link 1 comprise our main 
analyses, in which we investigate whether high equity incentive managers are more likely to 
manage earnings. While we primarily rely on prior research to establish Link 3, we test Link 2 to 
verify that high equity incentive managers sell more in the future. 


The equilibrium risk level results from a balance of two opposing forces. From an 
incentive-alignment perspective, equity incentives help motivate managers to work in share- 
holders' interests, thereby reducing agency costs. However, from the perspective of risk 
diversification, risk-averse managers want to reduce their exposure to the idiosyncratic risk 
of the firm. Consequently, managers want to sell shares if the risk exposure level is higher 
than the equilibrium level.* Even if the current risk exposure is at the equilibrium level, it 
can increase beyond this level in the future due to changes in economic circumstances. 

For example, if stock prices increase, then managers’ wealth becomes more concen- 
trated in a single risky asset (1.e., their firms’ stock). Thus, the exposure of managers’ wealth 
to the idiosyncratic risk increases. Furthermore, based on option valuation theory (such as 
Black and Scholes 1973), the sensitivity of the value of unexercisable options to stock price 
(i.e., the hedge ratio or the delta of the option) increases with stock price.? Under general 
capital asset pricing assumptions, the expected return of a firm's stock is positive; that is, 
stock prices are expected to increase in normal economic circumstances. As a result, man- 
agers' risk exposure due to equity incentives predictably increases in the future. If it in- 
creases beyond the equilibrium level, then managers might sell shares they already own to 
diversify the risk. 


* If managers' equity incentive levels are lower than the equilibrium level, then they can purchase shares from 
the open market. Some firms explicitly specify the target stock ownership for CEOs to facilitate incentive 
alignment (Core and Larcker 2002). | 

+ One can formally derive this result from examining the Black and Scholes (1973) formula. The intuition is as 
follows. The hedge ratio of an option equals the probability that the option is in the money when it is exercisable. 
(The hedge ratio of an in-the-money exercisable option is 1.) Since the probability for an option to be in the 
money when it is exercisable increases with stock prices, its hedge ratio increases with stock prices. 
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In summary, managers with higher equity incentives are more likely to sell shares (Link 
2 in Figure 1).and their wealth is therefore sensitive to short-term stock prices. This sen- 
sitivity can lead to incentives to increase short-term prices and the value of the shares 
managers are going to sell. To this end, they can make value-increasing production, fi- 
nancing, and investment decisions, and/or engage in earnings management. The latter is 
the focus of this study. 


Equity Incentives and Earnings Management 


Managers who sell shares in the future have earnings management incentives if two 
conditions hold: (1) the capital markets rely on reported earnings in forming beliefs about 
future earnings so that earnings management can affect stock prices, and (2) managers can 
take advantage of the increased stock prices. With respect to the first condition, Stein (1989, 
657) argues that “the stock market uses earnings to make a rational forecast of firm value— 
higher earnings today will be correlated with higher earnings in the future." Extant ac- 
counting research shows that stock prices generally react to earnings announcements in the 
direction of earnings surprises, with analysts' forecasts as the proxy for earnings expecta- 
tions (O'Brien 1988; Brown and Kim 1991; among others). Thus, managers can use their 
accounting discretion to affect reported earnings and stock prices if the capital markets have 
difficulty in detecting earnings management. 

If stock prices increase with earnings management, then managers gain more from 
selling shares in the future than without earnings management. Thus, condition (2) is sat- 
isfied, unless litigation or reputation concerns deter managers from taking advantage of the 
increased stock price. The evidence in prior research is consistent with managers being 
able to benefit from insider trading. For example, Seyhun (1998) and Lakonishok and Lee 
(2001), among others, find that insiders are generally better informed and earn abnormal 
returns. With respect to accounting information disclosed through management forecasts or 
earnings announcements, Penman (1982), Sivakumar and Waymire (1994), and Noe (1999) 
find that managers sell more after good news than after bad news. Summers and Sweeney 
(1998) and Beneish (1999) find that managers of firms accused of accounting fraud sell 
their own shares before these firms are formally subject to SEC enforcement actions. Sim- 
ilarly, Beneish and Vargus (2002) find evidence consistent with managers selling more after 
market participants overprice income-increasing accruals. 

Thus, extant research supports the link from insider trading to earnings management 
(Link 3 in Figure 1). This link, combined with the above-discussed link from equity in- 
centives to future trading, suggests that high equity incentive managers are more likely to 
engage in earnings management, relative to low equity incentive managers (Link 1 in Figure 
1). This prediction is consistent with the conclusion in Stein (1989): managers who care 
about short-term stock prices are expected to manage earnings to increase short-term stock 


Š While the efficient market assumption posits that managers cannot fool investors with earnings management, 
extant empirical evidence suggests otherwise. For example, Kasznik (1999) finds that earnings management can 
help companies avoid market penalties associated with missing earnings targets. Sloan (1996) and Xie (2001), 
among others, suggest that the capital markets misprice accruals, especially abnormal accruals, consistent with 
earnings management influencing stock prices. See Fields et al. (2002) for additional discussion of the pricing 
effects of earnings management. As discussed below, incentives for earnings management exist in our setting 
even if on average earnings management cannot affect stock prices. 


The Accounting Review, April 2005 


448 Cheng and Warfield 


prices and such behavior increases with the sensitivity of managers” utility to current stock 
prices.’ 

Last, earnings management incentives exist not only when investors are naive—fixating 
on reported earnings, but also when investors are sophisticated—adjusting for expected 
earnings management when pricing equity. Stein (1989) shows that when investors are 
sophisticated, in equilibrium, investors rationally expect managers to engage in earnings 
management, which they do. In other words, if the market has rational expectations of 
earnings management behavior but cannot completely undo earnings management of indi- 
vidual firms, it is optimal for market participants to price the reported earnings by adjusting 
for the impact of earnings management. Given investors' pricing behavior, managers find 
it optimal to manage earnings. 

In sum, as depicted in Figure 1, managers with high equity incentives are likely to sell 
shares. Such trading behavior motivates managers to care about short-term stock prices and 
to manage earnings. Formally, we hypothesize that: 


H1: The incidence of earnings management is positively associated with equity 
incentives. 


In this paper, we focus on one particular proxy for earnings management—meeting or 
just beating analysts’ forecasts. Recent studies of earnings management suggest that the 
disproportionate likelihood of meeting or just beating analysts' forecasts is an important 
manifestation of earnings management (Degeorge et al. 1999; Burgstahler and Eames 2003; 
among others). We focus on this proxy because it has more direct market consequences 
relative to other proxies, such as abnormal accruals. Prior research has documented the 
reward for meeting analysts' forecasts (Bartov et al. 2002; Kasznik and McNichols 2002) 
and the negative market consequences of missing analvsts' forecasts (Skinner and Sloan 
2002). To further corroborate and enrich our earnings management results, in Section V 
we also analyze the likelihood of large positive earnings surprises to investigate the exis- 
tence of earnings smoothing and the recognition of abnormal accruals. 


Ill. SAMPLE AND DATA 

The initial sample includes all firm-years with data on CEOs' stock-based compensation 
and ownership available from the Standard & Poor's ExecuComp database for the period 
1993—2000.5 We exclude financial institutions (SICs between 6000 and 6999) and utilities 
(SICs between 4400 and 5000) because managers in these regulated industries might have 
different motivations to manage earnings (Burgstahler and Eames 2003, among others). The 
final sample used for specific analyses varies due to additional data requirements and is 
discussed in the empirical analysis sections. 

The timing of variable measurement is chosen to be consistent with our theoretical 
development. As summarized in Figure 3, all equity incentive variables are measured during 
or at the end of fiscal year t. Since we hypothesize that these variables are correlated with 
future manager trading and earnings management, earnings management is measured based 


? Incentives to manage earnings are reduced by the associated costs, such as those arising from audit qualifications, 
SEC enforcement sanctions, or management reputation effects. In this paper, we first assume that the costs of 
earnings management are not correlated with equity incentives and thus do not affect our prediction. In Section 
V, we discuss the possibility that certain costs of earnings management—its impact on the value of managers’ 
equity holdings in the future—vary with equity incentives and explicitly test the implications of this possibility. 

* Although ExecuComp started coverage in 1992, the coverage is lower in 1992 than in other years. Thus, 
including 1992 data, while yielding similar results, might introduce a sample selection bias. 
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FIGURE 3 
Timing of Variable Measurement 


Fiscal year t, One year after the earnings 
when option grants and announcement for fiscal year t, when 
stock grants are measured managers” trading is measured 
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Fiscal-year-end of year t, Earnings announcement 
when unexercisable options, for fiscal year t, when 


exercisable options, and earnings Management 
ownership are measured proxies are measured 





This figure describes the timeline when the three sets of variables of interest are measured. In brief, 
equity incentives are measured before earnings management proxies (earnings surprises and 
abnormal accruals), which are based on the information disclosed at earnings announcements. 
Managers’ trading is measured after earnings announcements. 


on the information disclosed at the earnings announcement for fiscal year f, generally from 
one to four months after the fiscal year-end. All trading variables are measured after earn- 
ings announcements (subsequent to the measurement of equity incentive and earnings man- 
agement variables). 


Equity Incentives 

We consider five equity incentive elements: option grants in the current period, unex- 
ercisable options (excluding option grants in the current period), exercisable options, re- 
stricted stock grants, and stock ownership. We deflate these measures (in shares) by total 
outstanding shares of the firm. Because the potential benefit of increasing short-term stock 
prices, if any, is shared by all shareholders (regardless of whether non-CEO shareholders 
realize it), the benefit enjoyed by CEOs is thus proportional to the ratio of equity incentive 
measures in shares to total outstanding shares.? 

Panel A of Table 1 reports the descriptive statistics on CEO equity incentive measures. 
Because we assume that CEOs can sel] already held shares to diversify risk due to option 
or stock grants, we restrict our analyses to firm-years with ownership higher than option 
and stock grants.!? As reported in the table, option grants are on average 0.163 percent of 
outstanding shares. Option grants are part of the unexercisable options at the end of the 
year; the remaining unexercisable options average 0.254 percent. Exercisable options are 
on average 0.637 percent. 


? Baker and Hall (2004) analyze the conditions under which alternative proxies for managers’ equity incentives 
are valid. In brief, equity incentive proxies measured in proportion to firm value are most appropriate when 
managers' actions affect firm value on a dollar-to-dollar basis (e.g., purchasing a corporate jet). Thus, if the 
earnings management activities affect firm value on a dollar-to-dollar basis, instead of on a proportional basis 
(e.g., a corporate reorganization), the ratio measure we use is appropriate. Otherwise, a measure based on the 
magnitude of equity incentives is appropriate. We examine alternative measurements of equity incentives as part 
of our sensitivity tests reported later. 

10 This restriction increases the power of the tests but reduces the sample by about 25 percent. Relaxing this 
restriction does not affect the inferences. 
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TABLE 1 
Descriptive Statistics on Equity Incentives, CEOs’ Trading, and Earnings Surprises 


Panel A: Descriptive Statistics on CEOs’ Equity Incentives 


The descriptive statistics are based on 9,472 firm-years with ownership higher than option and 
stock grants in the period 1993—2000. Equity incentive measures (in shares) are scaled by total 
outstanding shares of the firm. 





Mean Std. Median 
Option grants (%) 0.163 0.309 0.051 
Unexercisable options (excluding option grants) (%) 0.254 0.429 0.078 
Exercisable options (%) 0.637 0.932 0.295 
Restricted stock grants (%)* 0.008 0.031 0.000 
Ownership (%) 4.184 7.602 0.825 


Panel B: Spearman Correlations between Equity Incentive Measures 


The correlations are based on 9,472 firm-years with ownership higher than option and stock grants 
in the period 1993-2000. Equity incentive measures (in shares) are scaled by total outstanding 
shares of the firm. All correlations are significant at the 0.05 level or better. 


Option Unexercisable Options Exercisable 
Grants (excluding option grants) Options 
Unexercisable options (excluding option grants) 0.210 
Exercisable options 0.366 0.432 
Ownership 0.163 0.034 0.101 


Panel C: CEOs' Net Sales in the Year after the Earnings Announcement 


The descriptive statistics are based on all firm-years in the period 1993—2000 with both equity 
incentives and net sales in a specific period after the earnings announcement. 





As Percentage of Market 
In Million Dollars Value 
n Mean Std. Median Mean Std. Median 
lst six-month 2,830 2.358 11.115 0.065 0.225 1.325 0.008 
2nd six-month 2,411 2.055 9.596 0.009 0.249 1.448 0.004 


Panel D: Number of Observations in Each Earnings Surprise Cell 


Earnings surprise per share is measured as the difference between actual earnings and the latest 
consensus forecast before earnings announcement. The sample includes 4,301 firm-years with 
equity incentives and earnings surprises in the period 1993—2000. 





Earnings Surprise (cents) n Earnings Surprise (cents) n 
G 1,057 
—1 379 1 746 
—2 174 2 448 
—3 94 3 271 
—4 71 4 176 
Less than —5 352 Greater than 5 533 


* Due to its small magnitude, we do not analyze restricted stock grants separately from ownership in the 
following analyses. 
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Restricted stock grants appear to play a small role in stock-based compensation. While 
63.2 percent of the sample report option grants, only 18.3 percent have restricted stock 
grants. The mean restricted stock grant is only 0.008 percent. Due to this small magnitude, 
stock grants can hardly have an economically significant impact on future trading or earn- 
ings management; thus for the sake of brevity, in the analyses we do not separate them 
from ownership, which averages 4.184 percent. These descriptive statistics are similar to 
those reported in Ofek and Yermack (2000). 

Panel B of Table 1 reports correlations between equity incentive measures. The three 
option measures are positively correlated with each other and are also positively correlated 
with ownership. 


CEOs’ Trading 

CEOs’ trading data are collected from the SEC ownership reporting system data file 
(Forms 3, 4, 5). CEOs and other insiders are required to report their transactions by the 
tenth day of the calendar month after the trading month. As in prior research of insider 
trading (e.g., Lakonishok and Lee 2001), we examine three primary types of transactions: 
open market purchases, purchase of shares through the exercise of options, and open market 
sales (which include sales of shares from exercised options). Because the database assigns 
different names to CEOs, we regard individuals with titles like “Chairman of Board" (42.4 
percent), “CEO” (12.6 percent), and “President” (45.0 percent) as CEOs, as done in Ke 
et al. (2003). 

We examine CEOs’ trading in the year after the earnings announcement for fiscal year 
t. CEOs’ trading is measured as net sales:!! 


Net sales = Open market sales — (Open market purchases + Options exercised). 


Since the frequency of CEO trading is generally low, using past trading data as the deflator, 
as in Lakonishok and Lee (2001), is not feasible. Instead, we scale net sales in dollars by 
the firm's market value at the end of fiscal year t (before net sales are measured). Using 
unscaled measures, either logarithm transformations or decile rankings, does not affect our 
inferences. 

Panel C of Table 1 reports descriptive statistics on CEOs' net sales. To allow for 
variation in CEO trading within the year, we separately investigate net sales in the first and 
second six-month periods. Net sales, on average, are approximately $2 million in a six- 
month period, representing about 0.2 percent of the firm's market value. These descriptive 
statistics are consistent with those reported in Lakonishok and Lee (2001) and suggest that 
CEOs are net sellers on average. 


Earnings Surprises 

Earnings surprises are calculated as the difference between actual earnings and analysts’ 
forecasts, both of which are measured on a per-share basis and are collected from First 
Call. To better capture the market's expectation, we use the most recent consensus forecasts 
within the three-month period before the earnings announcement. Ás reported in Panel D 
of Table 1 and consistent with prior research, there are more firm-years meeting or just 


!! Excluding options exercised from the calculation of net sales does not affect the inferences. 
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beating analysts’ forecasts than just missing analysts’ forecasts.!* Of 4,301 firm-years with 
equity incentives and earnings surprises in the period 1993—2000, 25 percent have zero 
earnings surprises, i.e., meeting analysts' forecasts, 17 percent beat analysts' forecasts by 
one cent, but less than 9 percent miss analysts' forecasts by one cent. 


IV. PRIMARY EMPIRICAL RESULTS 

As discussed in Section II, the link between equity incentives and future CEOs' trading 
is critical to our hypothesis that the incidence of earnings management is positively related 
with equity incentives. Since prior studies do not provide direct evidence on this link, in 
this section, we first report results on the relation between equity incentives and future 
trading. We then report results on the impact of equity incentives on the probability of 
meeting or just beating analysts' forecasts. Last, we investigate whether managers sell more 
after earnings management. 


Equity Incentives and Future Trading 


We examine the relation between equity incentives (Eq Incent) and future trading 
(NetSale) using the following regression: 


NetSale,,., = Yo + BEq Incent;, + yiSize,, + y;Growth,, + y,Ret;, + y,AOCF,, 
+ Eu (1) 


where j = 1 or 2, denotes the jth six-month period after the earnings announcement. See 
Appendix A for variable measurement. Our focus is whether the coefficients on Eq. Incent 
are positive. In the regression, we also include several control variables based on prior 
research of insider trading.'^ Lakonishok and Lee (2001) find that insiders in large firms 
sell more shares, although in a smaller proportion to market value. Rozeff and Zaman (1998) 
find that insiders in growth firms sell more shares. Prior returns are included to control for 
contrarian trading behavior by insiders; insiders tend to sell more shares after high stock 
returns (Lakonishok and Lee 2001). Change in operating cash flow is included to control 
for any potential impact of firm performance on insider trading. We use a cash flow measure 
because it is less subject to earnings management compared to a performance measure 
based on earnings. The correlations reported in Appendix B indicate that these control 
variables are significantly correlated with each other and thus it is important to control for 
them at the same time. 

The regression results are reported in Table 2. Consistent with our prediction, option 
grants, other unexercisable options, and ownership have significantly positive coefficients, 
suggesting that CEOs with high equity incentives are more likely to sell in the next year. 
Since option grants and other unexercisable options are similarly correlated with future 


12 Baber and Kang (2002) find that analysts’ forecast data in I/B/E/S or First Call are subject to split adjustment 
bias; because these databases adjust stock splits retrospectively and split-adjusted data are rounded to the nearest 
cent, the probability of zero or one cent forecast errors might be overstated. While the impact of this split- 
adjustment bias on our results is unclear ex ante, we replicate our analyses on observations without stock splits 
in the sample period (61 percent of the sample). The results are essentially the same. 

13 While Ofek and Yermack (2000) study the relation between stock/option grants and contemporaneous insider 
sales, they neither examine future insider sales nor do they study option/stock ownership. 

^ ]n all analyses, we also include year dummies to control for potential year-specific effects. Throughout the paper, 
outliers, defined as observations with an absolute value of the R-student measure greater than 3, are excluded. 
None of the regressions are subject to multicollinearity based on the diagnostics in Belsley et al. (1980). 
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TABLE 2 
Equity Incentives and Net Sales in the Future 


This table reports the results from regressing CEOs’ net sales (96) in the two six-month periods 
after earnings announcements on equity incentive measures (96) and control variables: 


NetSale,,,, = Yo + BEq Incent,, + y,Size,, + y,Growth,, + y3Ret,, + ysAOCF,, + up C) 


where j — 1 or 2, denotes the jth six-month period after the earnings announcement. See Appendix 
A for variable measurement. Year dummies are included to control for year fixed effects. The 
regressions are estimated on all firm-years with data available in the sample period 1993—2000. The 
p-values in parentheses are based on one-sided t-tests. 


Predicted 
Signs 1st Six-Month 2nd Six-Month 
Equity incentives 
Option grants + 0.112 0.045 
(0.001) (0.014) 
Unexercisable options + 0.091 0.101 
(excluding option grants) (0.001) (0.001) 
Ownership + 0.016 0.014 
(0.001) (0.001) 
Exercisable options + —0.013 0.015 
(0.862) (0.159) 
Contro] variables 
Size E —0.027 —0.035 
(0.001) (0.001) 
Growth _ — 0.229 —0.191 
(0.001) (0.001) 
Stock returns + 0.135 0.099 
(0.001) (0.001) 
Change in cash flow + —0.058 —0.174 
(0.712) (0.916) 
n 2,723 2,316 
Adj. R? 0.110 0.068 


trading, we combine these two variables together in subsequent analyses for the sake of 
brevity. This modification does not affect our inferences. 

In contrast, exercisable options have an insignificant impact on CEOs’ trading. This 
result might seem unexpected, given the apparent similarity between unexercisable and 
exercisable options. However, unexercisable options differ from exercisable options in the 
hedge ratio, which captures CEOs’ risk exposure due to option holdings. To illustrate, when 
options are first granted, the hedge ratio is between 0 and 1. If stock prices increase after 
option grants, then the hedge ratio of unexercisable options increases. To diversify the 
increased risk, the manager can sell shares already owned. By the time options are exer- 
cisable and in the money, the hedge ratio is approximately 1 and no longer increases with 
stock prices. Thus, there is no need for further risk diversification. As a result, while 


15 For example, suppose that the hedge ratio of options is 0.6 when granted (the sample average reported in Jensen 
and Murphy [1990]). Managers might sell shares to diversify the increase in risk (0.6 x number of options) 
when they are awarded options and sell more shares [(1 — 0.6) X number of options] before these options 
become exercisable. 


The Accounting Review, April 2005 


454 Cheng and Warfield 


unexercisable options are positively related to future insider sales, exercisable options are 
not. 

One might argue that although exercisable options likely have a constant hedge ratio, 
they expose CEOs to the firm's idiosyncratic risk, just as ownership does. Unlike exercisable 
options, ownership is a more predominant equity incentive element (a much larger propor- 
tion of CEOs' wealth, as indicated in Panel A of Table 1) and is thus a better proxy for 
CEOs' overexposure to idiosyncratic risk. Nevertheless, we admit that there are other pos- 
sible explanations for the insignificant results on exercisable options, a topic we leave for 
future research. Importantly for our subsequent analyses, the measures that predict future 
CEO trading should be correlated with earnings management, but those that do not predict 
future trading should not be correlated with earnings management. Our earnings manage- 
ment results are consistent with these predictions. 

The control variable results are generally consistent with prior research. Size exhibits 
a negative association with net sales, suggesting that sales by CEOs of large firms are 
smaller in proportion to market value than those of small firms. Book-to-market also has a 
negative coefficient, suggesting that CEOs of growth firms sell more shares. The positive 
coefficient on stock returns is consistent with CEOs selling more shares after price in- 
creases. Change in cash flow, however, has an insignificant coefficient, although it becomes 
significantly positive when growth and stock returns are not included. 

In sum, we find that CEOs with high unexercisable options or high stock ownership 
are more likely to sell in the future. This result suggests that equity incentives can cause 
managers' utility to be sensitive to short-term stock prices: if the short-term stock price is 
higher (lower), then the value of the shares to be sold, part of the management wealth, 
is higher (lower). The next section investigates whether this sensitivity induces a positive 
relation between equity incentives and earnings management, as hypothesized in H1. 


Equity Incentives and Earnings Management 
Tests of H1 are based on logit regressions of the probability of earnings surprises being 
in a certain range on equity incentive measures (Eq Incent) and various control variables: 


Prob(Surprise,, € ®,) = logit(yo, + B,Eq Incent,, + y, ,Size,, + y; ,Growth,, , 
+ y445ales Growth,, + *y,,NOA,, , + ys, Shares,, 
+ y«4Litigation;, + y; ImplicitClaim,, 
+ yg,Num  Estimate,, + *y9,CV. AF;, + *y19,Down Rev,, 
+ 6. (2) 


Surprise, , is earnings surprise per share, as defined above; ® could be one of the following 
two sets of earnings surprises: (1) negative earnings surprises; (2) zero or one cent; equity 
incentive measures include unexercisable options, exercisable options, and ownership. See 
Appendix A for variable measurement. Hypothesis 1 predicts positive coefficients on equity 
incentive measures when investigating the probability of meeting or just beating analysts' 
forecasts (i.e., earnings surprises being zero or one cent). Relatedly, H1 implies a negative 
relation between equity incentives and the probability of missing analysts’ forecasts (i.e., 
negative earnings surprises). 

Following prior research, we include various control variables that might be correlated 
with earnings surprises and/or equity incentives. Equity incentives are generally higher for 
small firms or growth firms, and high-growth firms might be more likely to meet or beat 
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analysts' forecasts to avoid the torpedo effect associated with missing analysts' forecasts, 
as documented in Skinner and Sloan (2002). Thus, we control for firm size and growth 
(proxied by the book-to-market ratio and sales growth). Beginning-of-period net operat- 
ing assets are included because Barton and Simko (2002) find that firms with high net 
operating assets in the previous year are less likely to meet or just beat analysts' forecasts, 
arguably due to a lack of flexibility in managing earnings upward. The number of outstand- 
ing shares is included because firms with more outstanding shares have smaller earnings 
per share and are thus more likely to meet or just beat analysts' forecasts of earnings per 
share (Barton and Simko 2002). 

. Prior research suggests other incentives to avoid negative earnings surprises. Ali and 
Kallapur (2001) and Francis et al. (1994) find that price declines due to negative earnings 
surprises can lead to litigation liability. Similarly, Bowen et al. (1995) argue that negative 
earnings surprises can have negative publicity effects, which can adversely affect the im- 
plicit claims between a firm and its stakeholders. Thus, the incentive to avoid negative 
earnings surprises increases with litigation risk and implicit claims. We include an industry- 
based litigation dummy variable and an implicit claim variable (proxied by labor intensity) 
as controls. 

We also include three variables to control for analyst forecast attributes: number of 
analysts, dispersion of individual forecasts, and a downward revision dummy. The incentives 
to meet analysts' forecasts can be stronger if more analysts are following the firm or if 
there is greater consensus among analysts (Payne and Robb 2000). As shown in Matsumoto 
(2002), it is easier to meet or beat analysts' forecasts that have been guided downward, so 
we include a dummy variable to indicate whether analysts’ forecasts are revised downward 
in the three months before earnings announcements. 

As reported in Appendix B, the univariate correlations between the probability of meet- 
ing or just beating analysts’ forecasts and independent variables, especially the equity in- 
centive measures, are generally consistent with our predictions. While many correlations 
between independent variables are significant, none is substantial enough to lead to 
multicollinearity. 

Table 3 reports the logit regression results. As expected, both unexercisable options 
and ownership exhibit significant positive effects on the probability of meeting or just 
beating analysts' forecasts. For example, a one standard deviation increase in unexercisable 
options increases by 16.3 percent the odds of meeting or just beating analysts' forecasts, 
while a one standard deviation increase in ownership increases by 30.5 percent the odds of 
meeting or just beating analysts’ forecasts.'é Also as expected, unexercisable options and 
ownership are negatively associated with the probability of missing analysts' forecasts. 

Although exercisable options are negatively correlated with the probability of missing 
analysts’ forecasts, they are not correlated with the probability of meeting or just beat- 
ing analysts' forecasts. This result is consistent with the trading evidence reported in the 
last section and with the argument that equity incentives lead to earnings management via 
future trading. Unlike unexercisable options and ownership, exercisable options are not 
correlated with future trading and are thus not correlated with earnings management in this 
test or our other analyses. 


16 The odds of an event are the ratio of the probability that this event occurs to the probability that this event does 
not occur. For example, the odds of meeting or just beating analysts' forecasts are the ratio of the probability 
that a firm meets or just beats analysts' forecasts to the probability of the firm having earnings surprises other 
than zero or one cent. The odds are a positive function of the probability that the event occurs. We interpret the 
results based on the change in the odds because this measure is a function of the coefficient in the logit 
regressions only. In contrast, calculating the change in the probability requires a reference point. 
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TABLE 3 
Equity Incentives and Meeting or Just Beating Analysts’ Forecasts 


The table reports the change in the odds of firms (1) missing analysts' forecasts, or (2) meeting or 
just beating analysts' forecasts by one cent, due to a change of one standard deviation of a 
continuous variable or from 0 to 1 for a dummy variable, and the p-value (in parentheses) of the 
corresponding coefficient in the logit regressions. 


Prob(Surprise,, € ®,) = logit(yo, + B, Eg. Incent;, + y, ,Size,, + y; ,Growth;, , 
+ y3,Sales_Growth,, + y, ,NOÀ,, , ys, Shares;, 
+ yo Litigation, + "y, ,InplicitClaim,, + *y4, Num, Estimate,, 
+ Yo CV. AF,, + Y10,Dewn_Rev,, + 0) (2) 
Surprise, , is earnings surprises per share; b could be one of the following two sets of earnings 
surprises: (1) negative earnings surprises; (2) zero or one cent. See Appendix À for variable 


measurement. Year dummies are included to control for year fixed effects. The logit regressions are 
based on 4,301 firm-years with all required data in the period 1993—2000. 


Meeting or Just Beating 
Negative Earnings Surprises Analysts? Forecasts 
o = (—, 0) (1) Ó = [$0, $0.01] (2) 


Predicted Change in the Odds Predicted Change in the Odds 
Signs (p-value) Signs (p-value) 


Equity incentives 


Unexercisable options E —0.159 + 0.163 
(0.001) (0.001) 
Ownership = —0.224 + 0.305 
: (0.001) (0.001) 
Exercisable options = — 0.098 + —0.043 
(0.010) (0.869) 
Control variables 
Size ? —0.352 ? 0.032 
(0.001) (0.607) 
Growth J — 0.090 = —0.243 
(0.985) (0.001) 
Sales_Growth = —0.194 + —0.033 
(0.001) (0.824) 
Net operating assets (NOA) + 0.100 = 0. 
(0.010) (0.737) 
Shares = 0.016 + 0.309 
(0.599) (0.001) 
Litigation = —0.083 + 0.589 
(0.160) (0.001) 
Implicit Claim = —0.063 + 0.187 
(0.052) (0.001) 
Num. Estimate = 70,121 + 0.242 
(0.012) (0.001) 
CV.AF + 0.496 = —0.733 
(0.001) (0.001) 
Down_Rev =< 7.939 + 0.980 
(0.999) (0.406) 
Pseudo R2 0.195 0.180 
Likelihood ratio 603 612 
(p-value) (0.001) (0.001) 


The Accounting Review, April 2005 


Equity Incentives and Earnings Management 451 


Results for the control variables are generally consistent with those reported in prior 
research. Large firms are less likely to miss analysts' forecasts and growth firms are more 
likely to meet or just beat analysts' forecasts. Consistent with Barton and Simko (2002), 
firms with large net operating assets in the previous year are more likely to miss analysts' 
forecasts. Firms with more outstanding shares, firms in high-litigation industries, and firms 
with more implicit claims are more likely to meet or just beat analysts' forecasts. Also, 
firms are more likely to meet or just beat analysts' forecasts when more analysts provide 
forecasts or when there is greater consensus among forecasts. In contrast to the expectation 
management argument, firms with downward revisions are more likely to miss analysts' 
forecasts than those without downward revisions, but this result is consistent with an- 
alysts' under-reaction to bad news, as documented in Abarbanell and Bernard (1992), be- 
cause the downward revision can be due to underlying economic changes. 

To further investigate whether expectation management contributes to the reported re- 
sults, we replicate the analyses using “old” consensus forecasts—the latest consensus fore- 
cast six months before the earnings announcement. These “old” forecasts are less subject 
to management guidance, which generally occurs in a short period before earnings an- 
nouncements. If expectation management is driving the results, then we should not observe 
similar patterns when we use “old” consensus forecasts. The results, not reported for the 
sake of brevity, are similar to those reported, suggesting that our results are mainly due to 
earnings management." 

To summarize, the results presented in this section are consistent with H1. CEOs with 
high equity incentives are more likely to meet or just beat analysts' forecasts and less likely 
to miss analysts' forecasts, compared to those with low equity incentives. 


Earnings Management and Future Trading Conditional on Equity Incentives 

In this section, we investigate the impact of meeting or beating analysts' forecasts on 
net sales separately for managers with high equity incentives and for those with low equity 
incentives. Observing more selling for managers with high equity incentives, who are more 
likely to manage earnings based on results reported above, suggests that managers increase 
sales after earnings management. 

To capture the differential selling, we first classify the sample into four groups based 
on aggregate equity incentives and earnings surprises: 


H_MB = firms with high equity incentives and meeting or beating analysts’ forecasts; 
H_Miss = firms with high equity incentives and missing analysts’ forecasts; 

L_MB = firms with low equity incentives and meeting or beating analysts’ forecasts; and 
L_Miss = firms with low equity incentives and missing analysts’ forecasts. 


We measure the aggregate equity incentives as the sum of unexercisable options, weighted 
by their hedge ratios as measured in Core and Guay (1999), and ownership; high (low) 
equity incentive managers are those with aggregate equity incentives higher (lower) than 
the sample median. 

We then include dummy variables that indicate each of the four groups into Equation 
(i: = 


17 The overall weak results on expectation management are consistent with its ambiguous net effect on stock prices. 
Managers can benefit from the guidance if the stock price is higher after (1) managing analysts' forecasts 
downward (and presumably resulting in a lower price) and (2) meeting or beating analysts' forecasts later. If 
managers are uncertain whether the upward impact of meeting or beating analysts' forecasts outweighs the 
downward impact of guidance, then they are likely not to use this tool. 
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NetSale;,,, = B,H MB,, + BH Miss,, + BJL MB,, + B,L_Miss,, 
+ y,Size,, + y,Growth,, + ysRet,, + y4AOCF,, E (3) 


The coefficient on each dummy variable captures how much each group of firms sells in 
the future. The difference in coefficients, B, — B, (B, — Ba), indicates whether high (low) 
equity incentive managers sell more after meeting or beating analysts' forecasts than after 
missing analysts’ forecasts. 

Table 4 reports the regression results. Coefficients on all dummy variables are signifi- 
cantly positive, suggesting that firms in each group are net sellers. As reported at the bottom 
of the table, the coefficient on H_MB is significantly higher than that on R_. Miss, at the 
0.05 level in the first six-month period and at the 0.08 level in the second six-month period. 
Thus, for high equity incentive managers, meeting or beating analysts' forecasts leads to 
more net sales in the first year after earnings announcements than missing analysts' fore- 
casts. However, meeting or beating analysts’ forecasts does not appear to motivate low 
equity incentive managers to sell more: there is no significant difference between the co- 
efficient on L..MB and that on L..Miss.!? 

Overall, these results are consistent with high equity incentive managers increasing 
their sales after earnings management. Doing so is beneficial if such earnings management 
can increase stock prices in the near term. In this sense, subsequent trading serves as the 
“exit strategy" for earnings management, thereby increasing the credibility of our earnings 
management evidence (Schipper 1989). 


V. ADDITIONAL ANALYSES 
Equity Incentives and Earnings Smoothing 

The evidence that managers with high equity incentives are more likely to report earn- 
ings that meet or just beat analysts' forecasts is consistent with their incentives to manage 
earnings upward to avoid negative earnings surprises, as suggested by Stein (1989). At the 
same time, managers might have incentives to manage earnings downward in years with 
good performance to reserve for the future. Earnings smoothing is appealing in this context 
for at least two important reasons. First, it is difficult, if not impossible, for a firm to manage 
earnings upward consistently. By engaging in income-decreasing earnings management in 
years with good performance, managers can increase earnings as needed in the future. Such 
smoothing reduces the likelihood of missing analysts' forecasts and increases the likelihood 
of meeting analysts' forecasts in future periods. 

Second, equity incentives can be recurring—managers with high equity incentives are 
more likely to have high equity incentives in the future. Unreported results indicate that 
equity incentives are positively serially correlated. Relatedly, as reported in Panel A of 
Table 5, equity incentives are positively correlated with managers' sales in the second year 
after earnings announcements. (As before, exercisatle options are not correlated with future 
trading.) Thus, managers with high equity incentives also care about stock prices in the 
future; if they cannot meet analysts' forecasts in the future, then they bear negative 
consequences. 

These discussions suggest that high equity incentive managers are likely to smooth 
earnings. One way to smooth earnings is to avoid large positive earnings surprises. By 
doing so, managers can reserve for the future and reduce the likelihood of missing analysts’ 
forecasts in the future. Thus, we expect a negative association between the incidence of 


13 A formal F-test indicates that (B,— BY) — (B4— B.) is significantly different from zero. 
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TABLE 4 
The Impact of Meeting or Beating Analysts’ Forecasts on CEOs’ Net Sales 
Conditional on Equity Incentives 


This table reports results from regressing future net sales (96) on four dummy variables, HMB, 
H .Miss, L . MB, and L. Miss, and control variables: 


NetSale,,,, = B,H MB,, + B,H_Miss,, + BL MB,, + BL Miss,, + y,Size,, + y, Growth,, 
+ y3Ret,, + y, AOCF,, + Uu (3) 


where j — 1 or 2, denotes the jth six-month period after the earnings announcement, and the four 
dummy variables are defined as follows: 


H_MB = 1 if the firm has high equity incentives and meets or beats analysts’ forecasts; 
H..Miss = 1 if the firm has high equity incentives and misses analysts’ forecasts; 

L..MB = 1 if the firm has low equity incentives and meets or beats analysts’ forecasts; and 
L..Miss — 1 if the firm has low equity incentives and misses analysts' forecasts. 


High or low equity incentives are classified based on the sample median of an aggregate equity 
incentive measure, which is the sum of unexercisable options, weighted by their hedge ratios as 
measured in Core and Guay (1999), and ownership. See Appendix A for the measurement of other 
variables. Year dummies are included to control for year fixed effects. The regressions are estimated 
on all firm-years with data available in the period 1993—2000. The p-values in parentheses are 
based on one-sided t-tests. The table also reports the difference in coefficients between H_MB and 
H_ Miss, that between L_MB and L. Miss, and the p-value of the corresponding F-tests. 


Predicted 
Signs 1st Six-Month 2nd Six-Month 
HMB (Bj) + 0.620 0.567 
(0.001) (0.001) 
H_ Miss (B) + 0.534 0.500 
(0.001) (0.001) 
L..MB (B4) t 0.469 0.414 
(0.001) (0.001) 
L..Miss (B,) + 0.419 0.410 
(0.001) (0.001) 
Control variables Yes Yes 
n 1,637 1,389 
Adj. R? 0.168 0.158 
The impact of meeting or beating analysts' forecasts on net sales (p-value) 
For high equity incentive t 0.086 0.067 
managers (B, — B) (0.054) (0.077) 
For low equity incentive + 0.050 0.004 
managers (B4 — B,) (0.186) (0.908) 


large positive earnings surprises and equity incentives, and this negative association should 
be stronger for firms with persistent equity incentives, which arguably have stronger incen- 
tives to smooth earnings. We test these two predictions below. 


Equity Incentives and Avoiding Large Positive Earnings Surprises 
Column (1) of Panel B, Table 5 reports the logit regression results of the probability 
of large positive earnings surprises (i.e., earnings surprises equal to or greater than four 
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cents) on equity incentives and control variables as in Equation (2). The results are con- 
sistent with earnings smoothing. À one standard deviation increase in unexercisable options 
reduces by 14.3 percent the odds of having large positive earnings surprises, while a one 
standard deviation increase in ownership reduces by 11.6 percent the odds of having large 
positive earnings surprises. The results are similar if we use different cutoff points (such 
as three cents, five cents, or six cents) to measure large positive earnings surprises. Con- 
sistent with the future trading evidence, exercisable options are not negatively correlated 
with the probability of large positive earnings surprises. 

If reserving for the future is successful, equity incentives should be positively correlated 
with the probability of meeting or just beating analysts' forecasts in the next year. Column 
(2) of Panel B reports the logit regression of meeting or just beating analysts' forecasts in 
year t+] on equity incentives in year t. While the results for option holdings are insignif- 
icant, firms with high ownership are more likely to meet or just beat analysts' forecasts in 
the next year, consistent with earnings smoothing. 


TABLE 5 
Equity Incentives and Earnings Smoothing 


Panel A: Equity Incentives and CEOs' Trading in the Second Year after Earnings 
Announcements 


This panel reports the results from regressing CEOs’ net sales (%) in the second year (the 3rd and 
4th six-month periods) after earnings announcements on equity incentive measures (96) and control 
variables: 


NetSale,,.; = Yo + B Eq. Incent,, + y,Size,, + y,Growth,, + ysRet,, + y4AOCF,, + Gay (D 


where j = 3 or 4, denotes the jth six-month period after the earnings announcement. See Appendix 
A for variable measurement. The regressions are estimated on all firm-years with data available in 
the period 1993-2000. Year dummies are included to control for year fixed effects. The p-values in 
parentheses are based on one-sided t-tests. 


Predicted 
Signs 3rd Six-Month 4th Six-Month 
Equity incentives 
Unexercisable options T 0.076 0.068 
(0.002) (0.011) 
Ownership + 0.021 0.019 
(0.001) (0.001) 
Exercisable options + 0.030 —0.002 
(0.144) (0.538) 
Control variables 
Size = —0.037 —0.048 
(0.001) (0.001) 
Growth = —0.233 —0.256 
(0.001) (0.001) 
Stock returns T 0.052 0.079 
(0.026) (0.007) 
Change in cash flow t 0.128 —0.191 
(0.790) (0.867) 
n 2,213 1,624 
Adj. R? 0.085 0.086 


(continued on next page) 
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TABLE 5 (continued) 


Panel B: Equity Incentives and Earnings Smoothing 


This panel reports the change in the odds of firms (1) beating analysts' forecasts by at least four 
cents in the current year, and (2) meeting or just beating analysts' forecasts by one cent in the next 
year, due to a change of one standard deviation of a continuous variable, and the p-value (in 
parentheses) of the corresponding coefficient in the logit regressions. See Table 3 for model 
specifications and Appendix A for variable measurement. Year dummies are included to control for 
year fixed effects. Column (1) is based on current year's earnings surprises ee and Column 
(2) is based on the next year's earnings surprises (Surprise, ,,,). Surprise,,,, is the difference 
between First Call actual EPS for year t-- 1 and the latest consensus forecast for year t-- 1 disclosed 
before the earnings announcement for year t+ 1, rounded to the nearest cent. 


The Probability of Large 
Positive Earnings The Probability of Meeting or 
Surprises Just Beating Analysts’ 
in the Current Year Forecasts in the Next Year 
Prob(Surprise,, = $0.04) Prob(Surprise,,,, = $0 or $0.01) 
(1) (2) 
Change in Change in 
Predicted the Odds Predicted the Odds 
Signs (p-value) Signs (p-value) 
Equity incentives 
Unexercisable options = —0.143 + 0.041 
(0.001) (0.170) 
Ownership = —0.116 t 0.167 
(0.007) (0.001) 
Exercisable options = 0.152 + —0.064 
(0.998) (0.942) 
Control variables Yes Yes 
n 4,301 3,429 
Pseudo R? 0.150 f 0.135 
Likelihood ratio 397 291 
(p-value) | (0.001) (0.001) 


Panel C: The Incremental Impact of Persistent Equity Incentives on Earnings Smoothing 


This panel reports the incremental impact of persistent equity incentives on earnings surprises: the 
probability of large positive earnings surprises in Column (1) and the probability of meeting or just 
beating analysts' forecasts in Column (2), using tbe following regression: 


Prob(Surprise,, € ®,) = logit(yo, + B, ,Total,, + B, Total,, X Recurring, 
+ B,,Exercisable options,, + "y, Controls,, + b) 


where Controls are those included in Equation (2). The table reports the change in the odds due to 
a change of one standard deviation of a variable and the p-value (in parentheses) of the 
corresponding coefficient in the logit regressions. The aggregate equity incentive measure, Total, is 
the sum of unexercisable options, weighted by the hedge ratio as measured in Core and Guay 
(1999), and ownership. To identify firms with persistent equity incentives, we first rank all firm- 
years into three equal-sized groups: high, medium, or low, based on the aggregate equity incentive 
measure. We then calculate the average ranking for each firm over the sample period. If a firm's 
average ranking is above the 3rd quartile or below the 1st quartile of the average ranking 
distribution, this firm is regarded as having persistent equity incentives (consistently high or 
consistently low equity incentives, respectively); for these firms, the dummy variable, Recurring, is 
defined as 1, and 0 otherwise. See Appendix À for the measurement of other variables. Year 
dummies are included to control for year fixed effects. The logit regressions are based on 3,958 
firm-years with all required data in the period 1993-2002. 
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Large Positive Meeting or Just Beating 
Surprises Analysts’ Forecasts 
b = [$0.04, o) Ó = [$0, $0.01] 
DIEM a a a TREE 
Predicted Change in the Odds Predicted Change in the Odds 
_ Signs — (p-value) — Signs (p-value) &— 
Total ? 0.024 4 0.262 
(0.760) (0.001) 
Total X Recurring ú —0.218 ? 0.038 
(0.001) (0.509) 
Exercisable options ? 0.063 ? 0.002 
(0.233) (0.961) 
Control variables Yes Yes 
Pseudo R2 0.164 0.198 
Likelihood ratio 368 577 
(p-value) (0.001) (0.001) 


Earnings Smoothing for Firms with Persistent Equity Incentives 

To test whether firms with persistent equity incentives are more likely to smooth earn- 
ings, we first rank all firm-years into three equal-sized groups: high, medium, or low, based 
on the aggregate equity incentive measure—the sum of unexercisable options, weighted by 
the bedge ratio as measured in Core and Guay (1999), and ownership. We then calculate 
the average ranking for each firm over the sample period. To increase the reliability of the 
classification, we only consider the 1,400 firms that have at least five years of data from 
ExecuComp over the eight-year period 1993—2000. 

Jf a firm's average ranking is above the third quartile or below the first quartile of the 
average ranking distribution, then this firm is regarded as having persistent equity incentives 
(consistently high or consistently low equity incentives, respectively). Under this approach, 
56 percent of the firm-years exhibit persistent equity incentives. We use a dummy variable, 
Recurring, to denote these firm-years. We then add an interaction of this dummy with 
equity incentives to the logit regression (Equation (2)) to capture the incremental effect for 
firms with persistent equity incentives. To simplify the analyses, we use the aggregate equity 
incentive measure (Total). 

The results are reported in Column (1) of Panel C, Table 5. As expected, equity incen- 
tives are negatively correlated with the probability of large positive earnings surprises only 
for firms with persistent equity incentives. That is, firms with persistent equity incentive 
measures are more likely to reserve for the future and those with less persistent equity 
incentives do not have such incentives. 

In addition, we test whether managers with persistent equity incentives drive the results 
on meeting or just beating analysts' forecasts reported in Table 3. In Column (2) of Panel 
C, we add an interaction between the recurring dummy and equity incentives in the logit 
regression of meeting or just beating analysts' forecasts in the current period. The coefficient 
on the interaction term is insignificant, suggesting that equity incentives affect the likelihood 
of meeting or just beating analysts' forecasts simularly for firms with persistent or non- 
persistent equity incentives. 


'9 The results are similar if we exclude firm-years with consistently low equity incentives. 
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Equity Incentives and Earnings Management via Abnormal Accruals 


In this section, we investigate the relation between equity incentives and abnormal 
accruals, a commonly used proxy for earnings management. In this paper's context, man- 
agers can benefit from income-increasing abnormal accruals. At the same time, the earnings 
smoothing incentives documented above imply income-decreasing abnormal accruals in 
years with good performance. Thus, ex ante, the direction of the correlation between equity 
incentives and abnormal accruals is unclear. If the incentive for increasing income domi- 
nates that for reserving for the future, the correlation is positive; otherwise, it is negative. 

Abnormal accruals are defined as the difference between total accruals and normal 
accruals. Normal accruals are estimated using the cross-sectional Jones (1991) model as 
described in DeFond and Jiambalvo (1994). Panel A of Table 6 describes the estimation 
method and reports the descriptive statistics on abnormal accruals. The mean and median 
abnormal accruals (scaled by beginning-of-period total assets) are 0.002 and 0.001, respec- 
tively. Untabulated statistics suggest that these numbers are insignificantly different from 
zero at the 0.05 level. 

Panel A of Table 6 also compares abnormal accruals and other firm characteristics 
between our sample and all Compustat firms with abnormal accruals. While there is no 
significant difference in abnormal accruals, the sample firms are larger and have better 
performance than the Compustat population. The average total assets are around $2 billion 
for the sample firms, but only $886 million for the Compustat population. Earnings and 
operating cash flow on average are 5 percent and 11.5 percent of beginning-of-period assets 
for the sample firms, but —10.8 percent and —2.6 percent for the Compustat population, 
respectively. These differences are due to data restrictions on equity incentive measures and 
we control for both size and firm performance in the analyses.” 


Equity Incentives and Abnormal Accruals 
We use the following regression to investigate the impact of equity incentives (Eq.- 
Incent) on abnormal accruals: 


Abnormal Accruals,, = yo + B Eg Incent;, + y,Size,, + y2Leverage,, 
+ y,Bonus,, + *y4Risk,, + ysGrowth,, + 44NOA,, , 
+ y,Litigation,, + y,lmplicitClaim,, + 6, (4) 


See Appendix A for variable measurement. In the regression, we control for factors that 
might affect abnormal accruals based on prior research. Positive accounting theory suggests 
that managers tend to manage earnings to decrease political costs (proxied by size), to relax 
debt covenants (the closeness to debt covenant violation is proxied by leverage), and to 
increase bonuses (Healy 1985; Watts and Zimmerman 1986; Sweeney 1994; among oth- 
ers).?! In addition, firms with high equity incentives are generally faster-growing and riskier 
(Lambert and Larcker 1987; Murphy 1999), and Warfield et al. (1995) indicate that riskier 


^ ExecuComp covers companies within the S&P 1500 (the combination of the S&P 500 index, the S&P MidCap 
400 index, and the S&P SmallCap 600 index). In addition to size requirements, companies must have positive 
earnings (excluding discontinued operations and extraordinary items) to be included in these indexes. 

21 To reduce the burden of data collection, details of bonus plans are not exploited in this study. Instead, we use 
a dummy variable to indicate firm-years with nonzero bonus. We find similar results when using alternative 
measures of bonus, including a dummy variable based on the ranking of return on assets as used in McNichols 
and Wilson (1988) and Bowen et al. (1995) and the ratio of cash bonus over total compensation. We also 
estimate the regressions after excluding the bonus variable and find similar results on equity incentive variables. 
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and high-growth firms have more abnormal accruals (which could be of either sign). Thus, 
we also control for the impact of systematic risk and growth opportunities, and several 
other variables that might affect earnings management, as discussed in Section IV. 

The regression results are reported in Panel B of Table 6. Both unexercisable options 
and ownership are positively correlated with abnormal accruals, and exercisable options are 
not. These results suggest that on average, high equity incentive CEOs increase earnings 


TABLE 6 
Equity Incentives and Earnings Management via Abnormal Accruals 


Panel A: Descriptive Statistics on Abnormal Accruals and Other Firm Characteristics 








The Sample Compustat Population 
(n = 6,867) (n = 50,984) 
Inter-Quartile Inter-Quartile 
Mean Median Range Mean Median Range 
Abnormal accruals 0.002 0.001 0.086 0.007 0.006 0.141 
Total assets ($million) 2,028 624 1.537 886 66 310 
Earnings 0.050 0.068 0.087 —0.108 4 0.026 0.196 
Operating cash flows 0.115 0.113 0.111 —0.026 0.056 0.191 


All variables, except total assets, are deflated by beginning-of-period total assets. The descriptive 
statistics for the sample are based on all firm-years with both equity incentives and abnormal 
accruals in the period 1993—2000. The Compustat population includes all firm-years in the same 
period with abnormal accruals. Abnormal accruals are defined as the difference between total 
accruals and normal accruals. Normal accruals are estimated using the cross-sectional Jones (1991) 
model as described in DeFond and Jiambalvo (1994). In this model, normal accruals are estimated 
as a function of change in revenues and level of property, plant, and equipment: 


TA, A, 1 Tm bo(1/A,,. 1) F b (AREV, ,/A;,_;) + b.(PPE, ,/A,,,-1) + Hu 
where: 


TA, , = total accruals, i.e., income before extraordinary items (#18) minus operating cash flows 
(#308); 
À,,., = beginning-of-period total assets (#6); 
AREV,, = change in net revenues (#12) from year t—1 to year z, included to control for normal 
changes in working capital; 
PPE,, = gross property, plant, and equipment (#7), included to control for normal depreciation 
expense; 
p, = error term; 
i,t = subscript for firm i in year t. 


The portion of total accruals that is not explained by normal operating activities, i.e., the residual 
Hzn is the estimate of abnormal accruals. Because operating characteristics might vary across 
industries and years, the estimation is conducted for each industry-year combination, based on all 
firm-years with data available from Compustat. Industry is defined on the basis of two-digit SICs 
and industry-year combinations with less than seven observations are deleted. 


(continued on next page) 
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TABLE 6 (continued) 
Panel B: Regressions of Abnormal Accruals on Equity Incentive Measures and Control 
Variables 
This panel reports the coefficients and the accompanying p-value of the following regression: 


Abnormal Accruals,, = Yo + B Eq_Incent,, + y,Size,, + y,Leverage,, + y,Bonus,, 
+ oyj4Risk,, + y;Growth,, + 44NOÀ,,., + yLitigation,, 
t y,lmplicit Claim,, + C, (4) 
This regression is estimated using the pooled sample with year dummies to control for year fixed 


effects. See Appendix A for variable measurement. The p-values in parentheses are based on two- 
sided t-tests. 


Abnormal 
Accruals 
Equity incentives 
Unexercisable options 0.695 
(0.012) 
Ownership 0.070 
(0.001) 
Exercisable options 0.140 
(0.407) 
Control variables 
Size 0.001 
(0.529) 
Leverage —0.032 
(0.003) 
Bonus 0.007 
(0.059) 
Risk —0.001 
(0.841) 
Growth ' —0.005 
(0.342) 
Net operating assets —0.014 
(0.001) 
Litigation —0.013 
(0.001) 
Implicit claim —0.019 
(0.601) 
n 6,337 
Adj. R? 0.022 


(continued on next page) 
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TABLE 6 (continued) 


Panel C: Regressions of CEOs’ Net Sales on Abnormal Accruals 


This panel reports the results from regressing future net sales (56) on equity incentives (9), 
abnormal accruals, and control variables: 


NetSale;,,, = Yo + B Eq Incent,, + Š AbnormaiAccruals,, + y,Size,, 
+ y,Growth,, + y3Ret,, + y,AOCF,, + E f 


where j = 1 or 2, denotes the jth six-month period after the earnings announcement. Positive and 
negative abnormal accruals are included separately to capture their potentially differential effects. 
Positive abnormal accruals are zero when abnormal accruals are negative, and negative abnormal 
accruals are zero when abnormal accruals are positive. To reduce the impact of measurement errors 
in abnormal accruals, decile ranks (scaled to the range between 0 and 1) are used. See Appendix A 
for variable measurement. The regressions are estimated on all firm-years with data available in the 
period 1993—2000. Year dummies are included to control for year fixed effects. The p-values in 
parentheses are based on one-sided t-tests for coefficients with signed predictions and on two-sided 
t-tests otherwise. 


Predicted 
Signs 1st Six-Month 2nd Six-Month 
Equity incentives 
Unexercisable options T 0.086 0.064 
(0.001) (0.002) 
Ownership t 0.016 0.014 
(0.001) (0.001) 
Exercisable options s —0.008 0.018 
(0.749) (0.117) 
Abnormal accruals 
Positive abnormal accruals + 0.075 0.074 
(0.019) (0.054) 
Negative abnormal accruals ? —0.050 —0.073 
(0.171) (0.093) 
Control variables Yes Yes 
n 2,723 2,316 
Adj. R? 0.109 0.068 


via abnormal accruals.” With respect to control variables, firms with high leverage, without 
a bonus plan, with high beginning-of-period net operating assets, in high litigation indus- 
tries, and with high implicit claims take less positive abnormal accruals. 

We conduct additional analyses to assess the robustness of our results to possible mea- 
surement errors in abnormal accruals (Dechow et al. 1995; Bernard and Skinner 1996). 
First, because firms with extreme performance are more likely to have measurement errors, 
we add a control for firm performance (return on assets, i.e., ROA) or exclude firms with 
extreme earnings (top 5 percent, 10 percent, or 20 percent based on |ROA| ranking). Second, 
we include risk, earnings persistence, and earnings variability as explanatory variables to 
better control for the volatility of the firm's economic environment or operations. Third, 
we replace abnormal accruals with inventory changes, which are shown to be used for 


2 Warfield et al. (1995) find a negative association between managerial ownership and the absolute value of 
abnormal accruals. Unlike Warfield et al. (1995), we focus on the relation between all equity incentive elements 
and signed abnormal accruals. Thus, one should be careful in.comparing our results with those of Warfield et 
al. (1995). 
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earnings management purposes and are less subject to measurement errors than abnormal 
accruals (Thomas and Zhang 2002). Our results are robust to these alternative specifications. 

Overall, these results suggest that CEOs with high equity incentives take more income- 
increasing abnormal accruals than those with low equity incentives. That is, the incentive 
to increase earnings dominates that to reserve for the future in this particular sample. While 
this finding appears at odds with the earnings smoothing argument discussed above, addi- 
tional analyses indicate that the results are largely driven by managers with less persistent 
equity incentives, who are less concerned with accrual reversal and have fewer incentives 
to reserve for the future. When we split the sample based on the average ranking of equity 
incentives over the sample period, as discussed above, we find that equity incentives are 
not significantly correlated with abnormal accruals for firms with persistent equity incen- 
tives, but the correlation is significantly positive for firms with less persistent equity 
incentives. 


Abnormal Accruals and Future CEO Trading 

Similar to analyzing the relation between meeting or beating analysts' forecasts and 
future CEO trading, we also investigate the relation between abnormal accruals and future 
trading. Panel C of Table 6 reports regression results when abnormal accruals are added to 
Equation (1). We add positive and negative abnormal accruals separately in the regression 
to detect any differential impact. The coefficients on positive abnormal accruals are signif- 
icantly positive, and the coefficients on negative abnormal accruals are insignificant (mar- 
ginally significantly negative in the second six-month period). These results are consistent 
with CEOs increasing sales after income-increasing abnormal accruals.? 


Other Sensitivity Tests 
Alternative Measures of Equity Incentives 

We examine the robustness of our results to the measurement of equity incentives in 
several ways. First, in the main analysis, we implicitly assume that the hedge ratio of options 
is constant across firms. To relax this assumption, we explicitly measure the hedge ratio 
for individual firms, as done in Core and Guay (1999), and adjust the measurement of 
exercisable and unexercisable options accordingly (i.e., multiplying the option measures by 
the associated hedge ratio). Second, as argued in Baker and Hall (2004), if earnings man- 
agement affects firm value on a proportionate basis rather than on a dollar-for-dollar basis, 
a more appropriate measure is the dollar value of the incentive variable. Thus, we also use 
the magnitude of equity incentives, measured as the natural logarithm of one plus the change 
in the value of unexercisable options and ownership due to 1 percent increase in stock 
prices. The results based on these alternative equity incentive measures are similar to those 
reported. 

Last, we measure stock-based compensation as the ratio of stock grants and option 
grants (both in dollars) to the total compensation (the sum of salary, bonus, stock grants, 
option grants, long-term incentive payouts, and other compensation). This measure captures 
the proportion of CEOs' compensation that is sensitive to stock prices. The results on this 
alternative measure, as reported in Panel A of Table 7, are consistent with earnings man- 
. agement behavior: stock-based compensation is positively correlated with the probability 
of meeting or just beating analysts' forecasts. The results on other variables, including 
ownership, are similar to those from the main analyses. 


2 Further analyses (not reported for the sake of brevity) indicate that managers taking more negative abnormal 
accruals sell more in the second year after earnings announcements, presumably after the reversal of abnormal 
accruals. This result is consistent with managers postponing sales if they take negative abnormal accruals. 
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Analyses of Future CEO Trading Conditional on Future Realized Returns 

As discussed in Section II, one important reason for managers with high equity incen- 
tives to sell shares in the future is that their exposure to the idiosyncratic risk increases 
with stock prices, because the sensitivity of options to stock price increases, or because 
their wealth becomes more concentrated in their companies’ stocks, or both. If this is true, 
then it follows that managers with the same equity incentives will sell more when stock 
prices increase more. To explicitly test this prediction, we add interactions between equity 
incentive measures and future realized returns into the future trading regressions. We use 
a dummy variable to capture large increases in stock prices; the dummy variable is 1 if the 
realized return in the period when net sales are measured is in the top quartile. 

We report the results in Panel B of Table 7. Consistent with our prediction, the inter- 
actions of unexercisable options and ownership with the high return dummy have signifi- 
cantly positive coefficients, suggesting that managers sell more in the future due to equity 
incentives when stock prices increase dramatically than otherwise. These results lend further 
support to our argument on future trading motivated by equity incentives. 


TABLE 7 
Additional Analyses 


Panel A: Results of Meeting or Just Beating Analysts’ Forecasts Based on the Ratio of Stock 
and Option Grants to Total Compensation 


This panel reports the results from regressing the probability of earnings surprises in a certain range 
on equity incentives and controls: 


Prob(Surprise,, € ®,) = logit(yo, + B,Eq Incent,, + y,Controls,, + š, ), 


where equity incentives include a stock and option grant ratio and ownership, and controls are 
those included in Equation (2). The table reports the change in the odds due to a change of one 
standard deviation of a variable and the p-value (in parentheses) of the corresponding coefficient in 
the logit regressions. The stock and option grant ratio is defined as the ratio of stock grants and 
option grants (both in dollars) to the total compensation, which includes salary, bonus, stock grants, 
option grants, long-term incentive payouts, and other compensation. See Appendix A for the 
measurement of other variables. Year dummies are included to control for year fixed effects. The 
regressions are based on 4,274 firm-years with all required data in the period 1993—2000. 


Meeting or Just Beating 
Negative Earnings Surprises Analysts? Forecasts 
Ó = TA 0) b = ps $0.01] 


Predicted Change in the Odds Predicted Change in the Odds 
Signs (p-value) Signs (p-value) 


Equity incentives 


Stock and option grant ratio — —0.103 + 0.105 
(0.004) (0.002) 
Ownership = —0.203 t 0.297 
(0.001) (0.001) 
Control variables Yes Yes 
Pseudo R? 0.195 0.180 
Likelihood ratio 603 612 
(p-value) (0.001) (0.001) 


(continued on next page) 
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TABLE 7 (continued) 


Panel B: Analyses of Future Net Sales Conditional on Future Realized Stock Returns 


This panel reports results from regressing future net sales (96) on equity incentives (96), a high 
return dummy variable (High ret), the interactions between equity incentives and the high return 
dummy, and control variables: 


NetSale a; = Yo + B Eq Incent,, + Š Eq Incent,, X High ret;,,, + x High_ret,,,, 
+ ySize,, + y,Growth,, + y4Ret,, + 4,AOCF,, + bip 


where j = 1 or 2, denotes the jth six-month period after the earnings announcement. High. ret, ,,, is 
1 when the return in the period t+j is in the top quartile. See Appendix A for the measurement of 
other variables. The regressions are estimated on all firm-years with data available in the period 
1993—2000. Year dummies are included to control for year fixed effects. The p-values in 
parentheses are based on one-sided t-tests. 


Predicted 
Signs 1st Six-Month 2nd Six-Month 
Equity incentives 
Unexercisable options t 0.056 0.017 
(0.003) (0.052) 
Ownership + 0.012 0.008 
(0.001) (0.001) 
Exercisable options + 0.001 0.006 
(0.476) (0.374) 
Equity incentives x High_ret 
Unexercisable options X High_ret + 0.080 0.230 
(0.017) (0.001) 
Ownership X High_ret F 0.016 0.023 
(0.001) (0.001) 
Exercisable options x High_ret + — 0.025 0.046 
(0.826) (0.072) 
High ret + 0.032 —0.031 
(0.337) (0.565) 
Control variables Yes Yes 
n 2,723 2,316 
Adj. R? 0.129 0.133 


Results for Subperiods 

To ensure that boom markets in late 1990s do not drive our results, we split the sample 
into two subsamples: 1993—1996 and 1997—2000. The results in these two subsamples are 
similar except that the impact of unexercisable options on net sales and earnings manage- 
ment, while significant in both subsamples, is slightly stronger in later years than in earlier 
years. 


Alternative Explanations 


One might argue that our results on meeting or beating analysts' forecasts are consistent 
with better firm performance attendant to the incentive alignment benefit of equity incen- 
tives. This alternative explanation is unlikely to drive our results for at least two reasons. 
First, financial analysts are generally regarded, and empirically confirmed, as sophisticated 
information intermediaries; thus, analysts arguably have incorporated into their forecasts 
the performance improvement due to equity incentives. If so, then forecast errors should 
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not be correlated with equity incentives. Second, better firm performance arising from 
incentive alignment benefits cannot explain the evidence that firms with high equity incen- 
tives are less likely to have large positive earnings surprises. Being less likely to have large 
positive earnings surprises is not a signal of good performance, but is consistent with our 
earnings smoothing argument. 

Another alternative explanation relates to the possibility that our results are driven by 
equity incentives proxying for firm characteristics, such as growth, risk, or earnings vola- 
tility. In the main and sensitivity analyses, we explicitly control for these factors. Also, 
untabulated results indicate that firms with volatile earnings (high CV. AF) are less likely 
to meet or just beat analysts' forecasts and more likely to have large positive earnings 
surprises, but the results on equity incentives are just the opposite. Furthermore, while both 
exercisable and unexercisable options are similarly related to firm characteristics (as shown 
in Appendix B), exercisable options, unlike unexercisable options as explained above, are 
not associated with future trading or earnings management measures. Such evidence sug- 
gests that omitted correlated variables are unlikely to drive our results. 

Finally, to control for industry characteristics (e.g., performance volatility and earnings 
informativeness) that might affect the level of equity incentives and the probability of 
meeting or just beating analysts' forecasts, we replicate all analyses using an industry- 
demeaned equity incentive measure. To this end, we combine unexercisable options, 
weighted by their hedge ratios, with stock ownership and then deduct from this measure 
its mean in the industry-year level (industry is defined based on three-digit SICs.) The 
inferences based on this measure are essentially the same. While we cannot completely rule 
out the possibility that these alternative explanations contribute to the reported results, we 
believe that our results are more consistent with earnings management induced by equity 
incentives rather than with these alternative explanations. 


VI. SUMMARY AND CONCLUSIONS 

In this paper, we examine the relation of equity incentives—broadly defined as stock- 
based compensation and stock ownership—with managers’ future trading and earnings 
management. We first study the link between equity incentives and managers' future trading. 
Based on all firm-years with available data over the 1993—2000 period, we find that high 
equity incentive managers are more likely to sell shares in the year after earnings an- 
nouncements. This selling behavior is consistent with managers' diversifying the increased 
risk arising from equity incentives, as argued in the paper, and/or with managers exploiting 
inside information on the pricing of their companies' stocks. As stated by Michael Jensen, 
"Once a firm's shares become overvalued, it is in managers' interests to keep them that 
way, or to encourage even more overvaluation, in the hope of cashing out before the bubble 
bursts" (The Economist 2002). Thus, managers with high equity incentives could benefit 
from earnings management with the objective of keeping stock prices high and increasing 
the value of the shares to be sold in the future. 

We then test whether high equity incentive managers engage in earnings management 
by examining whether these managers are more likely to report earnings that meet or just 
beat analysts' forecasts. The analyses indicate a significantly higher incidence of meeting 
or just beating analysts’ forecasts for managers with high equity incentives. Additional 
analyses document that managers with high equity incentives, especially those with con- 
sistently high equity incentives, are less likely to report large positive earnings surprises, 
consistent with earnings smoothing. 

Overall, our results suggest that stock-based compensation and ownership can lead to 
incentives for earnings management. If earnings management can increase short-term stock 
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prices, managers can benefit from doing so by increasing the value of the shares they are 
going to sell. Results from additional analyses indicate that high equity incentive managers 
can even gain more by selling more shares after reporting earnings that meet or beat 
analysts’ forecasts or after recognizing income-increasing abnormal accruals. Consequently, 
such earnings management behavior likely increases managers' wealth at the expense of 
outside shareholders. Thus, this paper extends research on compensation-related opportun- 
istic behavior, such as Aboody and Kasznik (2000). 

Our findings should be of interest to boards of directors contemplating compensation 
contracts for managers. While stock-based compensation and ownership can yield positive 
incentive effects, they can also have side effects, such as increased earnings management. 
An optimal compensation contract should reflect this trade-off. Our results also have im- 
plications for financial reporting regulators, who have criticized managers for focusing too 
much on short-term performance targets, potentially to the detriment of earnings quality 
(Levitt 1998). Given the increased importance of stock-based compensation as a form of 
executive compensation and managerial ownership as an incentive alignment mechanism, 
our analyses indicate that regulatory responses to address opportunistic earnings manage- 
ment should consider incentives arising from stock-based compensation and ownership. 


APPENDIX A 
Variable Measurement 


Variables Measurement 
Panel A: Equity Incentives 


Eq. Incent,, = equity incentives are measured in shares during or at the end of fiscal year 
t, scaled by the number of outstanding shares; this measure includes five 
components: option grants, unexercisable options (excluding option grants), 
exercisable options, restricted stock grants, and ownership. Unless noted, 
restricted stock grants are combined with ownership and option grants are 
combined with other unexercisable options. 


Panel B: Future Net Sales 


NetSale,,,, = CEO's net sales (in dollars) in the jth six-month period after the earnings 
announcement for fiscal year t, scaled by the market value at the end of 
fiscal year t, where j — 1 and 2, and net sales — open market sales — 
(open market purchases + options exercised). 


Panel C: Earnings Surprises 
Surprise,, — difference between actual earnings and the latest consensus forecast before 
earnings announcements, both of which are measured on a per-share basis 
and are collected from First Call; it is rounded to the nearest cent; and 


D. MB,, = meeting or just beating dummy variable, 1 if earnings surprises are zero or 
one cent, and 0 otherwise. 
Panel D: Abnormal Accruals 
Abnormal Accruals,, = abnormal accruals estimated from the modified cross-sectional Jones (1991) 
model (see Panel A of Table 6 for the estimation details). 
Panel E: Factors Affecting Future Net Sales 


Size,, = natural logarithm of market value (in million dollars) at the end of fiscal 
ear f, 
Growth,, = the book-to-market ratio at the end of fiscal year t; 
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Ret,, — buy and hold raw return in the 12 months prior to the earnings 
announcement for fiscal year t; and 
AOCF,, = change in operating cash flow in year t, scaled by total assets at the 


beginning of year t. 


Panel EF: Factors Affecting Earnings Surprises 


Size,, and Growth,, , as measured in Panel E. 
Sales. .Growth,, = sales for fiscal year t divided by sales for fiscal year t— 1; 
NOA,, = net operating assets (i.e., shareholders’ equity minus cash and marketable 
securities, plus total debt) at the end of fiscal year t—1, scaled by sales for 
fiscal year t—1; 
Shares, = number of common shares outstanding at the end of fiscal year t; 
Litigation,, — 1 if the firm is in one of the following industries: pharmaceutical/ 
biotechnology (SIC codes 2833—2836, 8731—8734), computer (3570—3577, 
7370-7374), electronics (3600-3674), or retail (5200—5961), and 0 
otherwise; 
ImplicitClaim,, — proxied by labor intensity, calculated as 1 minus the ratio of gross PPE to 
total assets at the end of fiscal year t; 
Num. Estimate,, = number of analysts whose forecasts are included in the consensus forecast 
used to calculate earnings surprise; 
CV. AF,, = coefficient of variation (standard deviation scaled by the mean) of the 
consensus forecast used to calculate earnings surprise; and 
Down. Rev,, = 1 if at least one of the firm's analysts revised his or her forecast downward 
in the three months prior to the earnings announcement for fiscal year t, 
and 0 otherwise. 


Panel G: Factors Affecting Abnormal Accruals 


Size,, and Growth,, as measured in Panel E; NOA,,, Litigation, „ and ImplicitClaim,, as measured in 
Panel F. 
Leverage,, = long-term debt divided by total assets, both at the end of fiscal year r; 
Bonus,, = 1 for firm-years whose CEOs have nonzero bonus, and 0 otherwise; and 
Risk,, = market beta in fiscal year t estimated from weekly returns. 


APPENDIX B 
Correlations between Variables 

This appendix reports Spearman correlations between key variables used in various 
analyses. Correlations between all variables, except for those noted below, are based on the 
sample used in earnings surprise analyses, 4,301 firm-years. Correlations between net sales 
or stock returns and other variables are based on the sample used in future trading analyses, 
2,723 firm-years. Correlations between leverage or risk and other variables are based on 
the sample used in abnormal accrual analyses, 6,387 firm-years. If two variables are not 
included in the same analysis, then we do not report their correlation and use NA to denote 
such occasions. If all correlations in a column or a row are not reported, then this column 
or this row is dropped from the table. See Appendix A for variable measurement. All 
correlations, except those denoted with #, are significant at the 0.05 level or lower. 
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ABSTRACT: Because of conflicting incentives among participants, collaborations (e.g., 
strategic alliances, joint ventures, and work teams) present a slgnificant control chal- 
lenge to managerial accountants. On the one hand, formal controls such as sanctioning 
and monitoring systems Improve cooperation by reducing the Incentives for opportun- 
istic behavior. On the other hand, prior research finds that the mere presence of a 
control system causes decision makers to view the collaborative setting as non- 
cooperative, and other collaborators as untrustworthy. In this paper, we conduct two 
experiments in which participants act as business collaborators. Through these ex- 
periments, we examine the effects of control on trust and cooperation in collaborative 
settings. Specifically, we posit and provide evidence that a strong control system can 
enhance the level of trust among collaborators. The mediating role of control-Induced 
cooperation provides the mechanism by which control systems can increase trust in 
collaborative environments. Furthermore, we show that this increased trust has a pos- 
itive effect on the subsequent level of cooperation among collaborators. Taken together, 
the results suggest an Increasing marginal benefit of control system strength arising 
from the trust that control-induced cooperation engenders. The implication Is that firms 
will choose to Implement a stronger control system than previous research would seem 
to suggest. 
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I. INTRODUCTION | 

ver the, past several decades, collaborative enterprise has emerged as an important 
() business trend. Popular forms of collaboration include teams, through which com- 

panies seek gains from the synergy of employees working together toward common 
goals, as well as strategic alliances, joint ventures, and industry consortia, through which 
companies reach outside their boundaries to gain a competitive advantage. This new col- 
laborative environment presents a managerial control challenge to firms as they attempt to 
identify optimal levels of control mechanisms while simultaneously considering the impact 
of such mechanisms on interpersonal trust. Prior research finds that trust is a primary 
determinant of collaboration success (e.g., Spekman et al. 2001; Zaheer et al. 1998; Zaheer 
and Venkatraman: 1995). However, some scholars suggest that control systems reduce trust 
among collaborators. Several theories have been used to make this argument. For example, 
some scholars argue that the mere presence of a control system changes the way decision 
makers mentally frame the situation, causing them to perceive other collaborators as less 
trustworthy (e.g., Tenbrunsel and Messick 1999). Others suggest that cooperation in the 
presence of a control system is attributed to the constraints imposed by the control system 
rather than the cooperator's inherent trustworthiness, thus inhibiting the development of 
trust (e.g., Malhotra and Murnighan 2002). 

In this paper we challenge the view that control systems reduce trust. We argue that 
prior studies documenting the negative effects of control systems are limited because they 
fail to incorporate two common features of control systems. First, prior studies do not allow 
for the increased cooperation that is likely to arise from the economic incentives inherent 
in control systems. For example, Malhotra and Murnighan (2002) compare the level of trust . 
that develops when control systems are present versus absent. In that study, participants 
believed they were interacting with other participants via a computer network. However, 
these other participants were actually computers pre-programmed to always cooperate. 
Thus, Malhotra and Murnighan (2002) hold the level of collaborative cooperation constant 
across conditions. Second, prior studies do not allow for feedback regarding levels of co- 
operation (e.g., performance reports). Tenbrunsel and Messick (1999), for example, simply 
do not provide participants with feedback on the level of collaborative cooperation. Our 
study, however, allows for control-induced cooperative behavior that is observed by partic- 
ipants. Specifically, we posit and provide evidence that a control system can actually en- 
hance the level of trust among collaborators, provided the control system is strong enough 
to induce cooperation and that this cooperation is observed by the collaborators. Further, 
we show that this increased trust has a positive effect on the future level of cooperation 
among collaborators. Taken together, the results suggest an increasing marginal benefit of 
control system strength arising from the trust that control-induced cooperation engenders. 
The implication is that firms will choose to implement a stronger control system than 
previous research would suggest. 

This study is important to managers who form collaborations and to managers, man- 
agerial accountants, and internal auditors charged with establishing managerial control sys- 
tems. While firms have eagerly embraced collaboration as a key strategy, many types of 
collaborations are notoriously unstable, with a number of experts attributing their high 
failure rate to a lack of trust. This study suggests that one approach to increasing trust is 
to strengthen the control systems used to govern collaborative agreements, both in terms 
of their incentives and their feedback. That is, through increased monitoring, sanctioning, 
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and rewarding, firms can induce higher levels of cooperation. The control-induced coop- 
eration, when observed by collaborators via feedback mechanisms such as performance 
reports, will engender trust, thereby reinforcing the positive effects of the control mecha- 
nisms. Thus, this paper answers recent calls for research to provide "insights regarding 
whether explicit contracts based on managerial accounting information foster or destroy 
reciprocity and cooperation" (Sprinkle 2003, 296). 

We conduct two experiments to explore the effects of control on trust and cooperation. 
The first experiment is conducted using the traditional approach of psychology-based ex- 
perimental work, whereas the second uses an approach more representative of the experi- 
mental economics literature. We exploit the advantages of each method in order to “tri- 
angulate" the results and develop a more comprehensive understanding of the complex 
relations among control, trust, and cooperation. Thus, this study illustrates the potential for 
research that combines economics and psychology-based theory to provide insights into 
accounting issues (Haynes and Kachelmeier 1998; Merchant et al. 2003; Moser 1998; 
Waller 1995, 2001). 

In both experiments, participants assume the role of a manager involved in a collabo- 
ration. In Experiment 1, we manipulate the presence or absence of a control system and 
randomly assign participants such that inherent trustworthiness is not expected to differ 
across conditions. We find that participants cooperate more in the control system condition, 
suggesting that the control system manipulation provides incentives strong enough to induce 
cooperation. Importantly, a second set of observer participants rate the participants assigned 
to the control system condition as more trustworthy than those assigned to the no control 
system condition. Since the inherent trustworthiness is not expected to differ across con- 
ditions, it appears that the observer participants attribute the control-induced cooperation 
to the inherent trustworthiness of the managers. Thus, we find that the presence of a control 
system enhances the level of trust in a collaborative setting, and that observed cooperation 
perfectly mediates this effect. 

We corroborate and extend these findings in the second experiment, which, unlike 
Experiment 1, allows us to investigate the effect of trust on subsequent cooperative behavior. 
We again manipulate the presence or absence of a control system, randomly assigning 
participants such that inherent trustworthiness is not expected to differ across conditions. 
Participants in this experiment make decisions under explicit incentives and salient cash 
payments. Over 20 periods of play, we again find that cooperation is greater for those 
participants operating under a control system than for those operating without one. After 
20 periods, we remove the control system, such that, although the conditions vary in terms 
of the interactive histories, the incentives going forward are equivalent across the two 
experimental conditions. We find that trust is greater for those participants who had been 
operating under a control system than for those who had not. We again document that 
observed cooperation perfectly mediates the effect of the control system on trust. Further, 
we find that participants previously operating under a control system continue to be more 
cooperative than those previously operating without one. 

The remainder of the paper is organized as follows: In Section II, we review the relevant 
literature and develop the hypotheses. In Section III, we describe the experimental meth- 
odology and results for each experiment. Section IV concludes and discusses the implica- 
tions of the study. 
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IL. BACKGROUND AND HYPOTHESIS DEVELOPMENT 
The Growing Use of Collaborations 


We define a collaboration as two (or more) parties involved in some type of joint 
production, such that each party's individual output cannot be directly observed. Our def- 
inition applies generally to many different types of collaborations that are being increasingly 
used to establish competitive advantage. For example, in a survey of 700 American workers 
by Dale Carnegie Training, 90 percent of the respondents reported spending some time 
working in teams (Allerton 1996). Firms also increasingly engage in alliances with other 
firms (e.g., licensing agreements, joint ventures, industry consortia, and strategic supply 
chains). (See Anderson and Sedatole [2003] and Birnberg [1998] for a discussion of col- 
laboration types and their inherent control problems.) According to a recent survey, the 
average firm in a group of 90 multinationals formed 117 alliances over a three-year period 
(Gordon et al. 2000). In fact, many now view relational capital—the ability to attract and 
interact effectively with alliance partners—-as a core competency (Schifrin 2001). 


Risk and Control in Collaborations 


While collaborations undoubtedly provide many advantages, they are also quite unsta- 
ble (Church 1996; Das and Teng 2000; Dumaine 1994; Parkhe 1993). Collaborations are 
vulnerable to failure because they are exposed to both “performance risk" and "relational 
risk" (e.g., Das and Teng 1996, 1999, 2001). Performance risk is the probability that col- 
Jaboration objectives will not be achieved despite the full cooperation of the partners. This 
type of risk comes primarily from market forces such as competition and demand fluctu- 
ations and is, therefore, not unique to collaborations. Relational risk, on the other hand, is 
the probability that collaboration objectives will not be achieved because of a lack of 
cooperation. This risk is unique to collaborations and is, therefore, the focus in this paper. 

High levels of relational risk arise because of the social dilemma nature of collabora- 
tions. Social dilemmas are situations where opportunistic behavior leads to suboptimal out- 
comes but cooperative behavior is not individually rational (Dawes 1980). Collaborations 
are formed because of the belief that there are gains from cooperation and information 
sharing. However, the difficulty of measuring individual contributions to collaborative out- 
put creates an incentive for opportunistic behavior, as collaborators are tempted to free-ride 
and withhold information. Further, if all collaborators behave in this manner, then the 
collaboration itself is destined to fail. 

Management accountants can play an important role in mitigating relational risk 
through the design and implementation of formal control mechanisms, such as systems of 
monitoring, sanctioning, and rewarding (Seal et al. 1999). Control, broadly defined, com- 
prises the various policies and procedures that firms use to mitigate different types of risk. 
Controls mitigate relational risk by changing the incentives for opportunistic behavior. That 
is, periodic monitoring increases the probability that opportunistic behavior will be detected, 
and sanctioning (rewarding) systems impose penalties (bonuses) on collaborators who (do 
not) engage in such behavior. Prior empirical research documents the use of formal control 
mechanisms to reduce relational risk in collaborations. Ittner et al. (1999), for example, 
document benefits of increased selection and monitoring practices (e.g., frequent meetings 
and supplier certification programs) for supplier partnerships (see also, Groot and Merchant 
2000; Clement 1997). Others have documented the benefits of performance measurement 
(e.g., Meyer 1994) and surveillance (e.g., Sewell 1998) in collaborative settings. 
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From a theoretical perspective, agency theory provides insights into the optimal level 
of control for mitigating relational risk in collaborative settings.! Using a similar approach, 
Figure 1, Panel À illustrates the fundamental trade-off between the cost of control and the 
costs incurred when collaborators behave opportunistically. Specifically, we assume that the 
expected cost of opportunistic behavior (i.e., relational risk) decreases with control system 
strength. Offsetting this benefit is the cost of implementing and maintaining the control 
system, which increases with control system strength. As the figure illustrates, there is an 
implied optimal level of control for which total costs are minimized.^ Importantly, this 
optimal control level likely results in something less than full cooperation. In other words, 
the firm must assume some relational risk because complete contracts (i.e., providing full 
cooperation) are prohibitively expensive, largely due to the performance ambiguity inherent 
in collaborations. 

As implied by Figure 1, Panel À, in choosing the level of control, it is important for 
firms to understand the relation between the strength of the control system and total cost. 
Below, we explore how this relation changes when trust is added to the equation (Chenhall 
and Langfield-Smith 2003; Spekman et al. 2001; Zaheer et al. 1998; Zaheer and 
Venkatraman 1995). 


Control and Trust 


We define trustworthiness as an innate personal characteristic reflecting one's prefer- 
ence for upholding some social norm of behavior, regardless of economic incentives? In 
contrast, we consider trust to be one's perception of another's trustworthiness. Because 
collaborations are formed specifically to capture gains from synergy, it is reasonable to 
assume that cooperation is viewed as a relevant social norm in these settings. Thus, trust 
is conceptualized as the perceived likelihood that another person will cooperate, absent any 
economic incentives to do so (i.e., “I trust you to do the right thing, because you are a 
trustworthy person"). It is important to note that our definition differs from a game-theory 
perspective (e.g., Williamson 1993), which assumes that trust occurs when the economic 
incentives favor cooperative behavior (i.e., “I trust you to do the right thing, because it is 
not in your best interest to do the wrong thing"). Throughout this paper, the term trust- 
worthiness refers to an innate personal characteristic, while trust refers to an individual's 
perception of another's trustworthiness.* 

Prior research suggests that control systems can have unintended negative effects on 
trust, a finding of particular concern because of trust's critical role as a determinant of 
collaborative success (Das and Teng 1998; Jarvenpaa et al. 1998; Kale et al. 2000; Kramer 
and Tyler 1996; Landry 1998; McAllister 1995; Spagnolo 1999). If this claim is correct, 
then this presents a conundrum to the designers of managerial control systems, who must 
choose the optimal level of control, considering both the benefits (through reduced incen- 
tives for opportunistic behavior) and the costs (including a degradation of trust). 


! Specifically, this research shows that firms may make cost-saving reductions in explicit incentives (i.e., weaken 
the control system) in contracting with employees because of the supporting implicit incentives (e. g., peer 
pressure, mutual monitoring) provided by team members (Arya et al. 1997; Barron and Gjerde 1997; Che and 
Yoo 2001). Balakrishnan et al. (1998) further show a trade-off between the cost and benefits (team performance 
improvements) in choosing the strength of the control system (i.e., the informativeness of an audit). 

We do not claim to have operationalized an optimal control system, because to do so would require a specification 
of control system cost, which is outside the scope of this study. Rather, we examine how incorporating trust 
into the model shifts the optimal level of control. 

Relevant social norms may include honesty, fairness, cooperation, commitment, reciprocity, REDDE etc. 

See Kramer (1999) for other conceptions of trust in an organizational setting. 
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FIGURE 1 
Control Costs, Relational Risk, and the Optimal Level of Control 


Panel A: Fundamental Trade-Off between Control Costs and Relational Risk 
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Two main bodies of theory have been proposed to explain why control systems may 
reduce trust and cooperation in collaborative settings. The first body of theory is related to 
framing. According to this theory, decision making in social situations begins with a cat- 
egorization process. That is, the decision maker asks, “What kind of situation is this?" 
(March and Heath 1994, 58). The answer to this question depends on environmental cues. 
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The resulting categorization, or decision frame, has a powerful influence on the decision 
maker, affecting not only the perceptions of what norms are applicable, but also the ex- 
pectations of others (Messick 1999). 

Firms form collaborations specifically to capture gains from synergy and cooperation. 
Therefore, the collaborative setting itself serves as an environmental cue, and in the base 
case, decision makers are likely to frame the situation as inherently cooperative, and other 
collaborators as inherently trustworthy. Tenbrunsel and Messick (1999) suggest, however, 
that the introduction of a control system may serve as another cue, affecting how the 
situation is categorized. Specifically, they examine social dilemmas in a collaborative set- 
ting, and find that a control system invokes a business (as opposed to ethical) frame. 
Consequently, with a control system, collaborators focus on a logical and rational evaluation 
of the costs and benefits of cooperation, rather than on social norms of behavior. This frame 
increases the salience of prospects for opportunistic behavior by both the decision maker 
and others. In fact, Tenbrunsel and Messick (1999) provide evidence that both cooperation 
and trust (i.e., expectations of cooperation) are lower under a weak control system than 
under no control system.? They conclude that control systems may “have a negative influ- 
ence on perceptions, perhaps promoting a distrust of coworkers" (Tenbrunsel and Messick 
1999, 693). 

Second, Attribution Theory (Birnberg et al. 1977; Jones 1990; Kelley 1967) has been 
used to examine the effects of control systems on perceived trustworthiness and trusting 
behavior (e.g., Kruglanski 1970; Malhotra and Murnighan 2002; Strickland 1958). Attri- 
bution Theory comprises a wide body of literature aimed at understanding how people 
make causal attributions regarding, for example, the behavior of others. Generally speaking, 
behaviors can be attributed to either dispositional or situational factors. In the present 
context, when a collaborator cooperates, other collaborators may attribute the cooperation 
either to the person's innate trustworthiness (a dispositional characteristic) or to the fact 
that there is a control system in place (a situational characteristic). For example, Malhotra 
and Murnighan (2002) find that cooperation in the presence of a control system (i.e., binding 
contract) is attributed at least partially to the constraints imposed by the control system. 
On the other hand, cooperation in the absence of a control system (i.e., nonbinding contract) 
is perceived to be a reflection of a person's trustworthiness. Malhotra and Murnighan (2002) 
conclude that control systems inhibit the development of trust. 


Limitations of Prior Research 


Prior studies fail to incorporate two common features of control systems: (1) increased 
cooperation induced by the economic incentives control systems provide, and (2) feedback 
regarding that cooperation (e.g., performance reports). Because of these two exclusions, 
prior studies inhibit the positive effects of control systems. Some studies hold the level 
of collaborative cooperation constant. For example, participants in the Malhotra and 
Murnighan (2002) study believed they were interacting with other participants via a com- 
puter network. However, these other participants were actually computers pre-programmed 
to always cooperate. Thus, this study compares how trust develops when individuals observe 
others cooperating in the presence versus absence of a control system. In contrast, our study 
recognizes that effective control systems provide economic incentives for cooperation, and 


* Researchers have made similar claims in other contexts related to the more general idea that extrinsic motivation 
reduces intrinsic motivation (including trust) (Bénabou and Tirole 2003; Bohnet et al. 2001; Frey and Jegen 
2001; Pfeffer 1998a, 1998b), a concept formalized in economics as “Motivation Crowding Theory” (Frey 1997). 
For example, Yamagishi (1998) argues that Japanese subjects are less trusting than American subjects because 
contract enforcement mechanisms are more prevalent in Japan. 
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so the level of cooperation is likely to be higher when a control system is in place. Thus, 
our study incorporates the mediating role of control-induced cooperation. 

Further, some studies (e.p., Tenbrunsel and Messick 1999) overlook feedback (e.g., 
performance reports), a common component of formal control systems aimed at reducing 
relational risk. This limitation is significant, because feedback can potentially mitigate the 
negative framing effects described above. In the multiperiod collaborative setting, feedback 
regarding partner actions becomes an additional environmental cue that affects how collab- 
orators evaluate their partners and how they frame the situation in subsequent decision- 
making periods. Specifically, in viewing their partners' control-induced cooperation, col- 
laborators may favorably update their beliefs about partner trustworthiness. Kramer (1999) 
notes that perceptions of trustworthiness are largely based on such interactive histories. 

In summary, prior studies inhibit the positive effects of control by ignoring the possi- 
bility that observed control-induced cooperation can build trust. In contrast to prior work, 
we predict and show that a strong control system increases cooperation. Further, we allow 
collaborators to observe this increased cooperation (i.e., cooperation feedback). We expect 
this observation of control-induced cooperation to lead to increased trust because of a 
documented tendency of observers to overattribute others' behaviors to dispositional char- 
acteristics (Gilbert and Malone 1995; Jones and Harris 1967). This result is so widely 
observed that it is known as the Fundamental Attribution Error (Ross 1977). 

In the current study's context, when collaborators cooperate in the presence of a control 
system, observers (i.e., other collaborators) are likely to at least partially attribute this 
control-induced cooperation to the collaborators' inherent trustworthiness. Thus, the control 
system will have a positive effect on the level of trust among collaborators. Importantly, 
although collaborators are unlikely to ignore situational factors, the tendency to attribute 
cooperative behavior to dispositional factors will lead to a strong judgment of trustworthi- 
ness (a disposition). 

Observed control-induced cooperation provides a basis for the formation of trust be- 
tween collaborators. This effect is illustrated in Figure 1, Panel B, in two ways. First, the 
revised relational risk curve lies primarily below the original curve. That is, because the 
explicit controls are augmented by trust, the probability of opportunistic behavior, and 
hence, the cost associated with relational risk, decreases for a given level of control.? 
Second, the revised curve is steeper than the original curve. This reflects the notion that 
the level of cooperation is expected to increase with control strength, and so the positive 
effects of trust should also increase with control strength." In other words, the marginal 
benefit of control increases because of the trust that control-induced cooperation engenders.* 
There is a corresponding shift in the total cost curve that results in a higher optimal (i.e., 
minimum total cost) level of control. The implication is that firms will choose to implement 
a stronger control system than previous research would seem to suggest. The process by 
which we predict control to positively affect trust and cooperation is pictured in Figure 2 
and summarized by the hypotheses below. 


$ This is in contrast to the prevailing view of the effect of control on trust, which would suggest that the revised 
curve lies above the original curve. 

7 Atextreme levels of control system strength, it is theoretically conceivable that the revised curve would not be 
steeper than the original curve (i.e., the effect of trust on cooperation might not be increasing in the level of 
control). However, this condition is likely outside the norm of operations for most firms. 

8 We show the revised curve crossing above the original curve at extremely low levels of control to allow for the 
possibilities (if the control system is sufficiently weak) that (1) feedback might reduce trust (because of observed 
noncooperation), or that (2) the negative framing effects described earlier might outweigh the positive effects 
we propose. 
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Hypotheses 

As described above, we expect the level of trust to increase in control system strength. 
Prior studies have generally viewed control as a dichotomous variable (control versus no 
control) Thus, in developing and testing the hypotheses, we chose two points along the 
control strength continuum—no control system versus a strong control system. We expect 
and provide evidence for the necessary condition that the control system will lead to in- 
creased levels of cooperation (relative to no control system). We further assume that this 
control-induced cooperation will be observed by collaborators, providing a mechanism 
whereby attributions for control-induced behavior can develop. Hypotheses 1a and 1b ad- 
dress the fundamental research question of whether control systems create or destroy trust 
in collaborative settings. Our theory suggests that when collaborators observe others co- 
operating, they will at least partially attribute this behavior to the others' inherent trust- 
worthiness. Thus, while two participants may be equally trustworthy, the more cooperative 
person will be judged as more trustworthy, despite the fact that the cooperation may have 
been induced by the control system. This results in the following hypothesis. 


Hla: Participants will judge others to be more trustworthy when there is a control 
system in place (that induces cooperation) than when there is no control system. 


The mechanism by which this effect occurs is the increased, albeit control-induced, 
cooperation of participants whose trustworthiness is being assessed. That 1s, we predict that 
a strong control system will positively affect perceived trustworthiness precisely because 
of the control-induced cooperation. Thus, the prediction that control systems increase trust 
depends critically on the role of observed cooperation as a mediating variable. Accordingly, 
we make the following prediction: 


Hib: The effect of a control system on perceived trustworthiness will be mediated by 
cooperation. 


The trust that develops from observing control-induced cooperation is self-reinforcing 
in that, consistent with the prior literature, we expect trust to lead to subsequent cooperation. 
There is a large body of literature in management and psychology claiming that trust 
promotes cooperation and is, in fact, an alternate control mechanism (Bradach and Eccles 
1989; Das and Teng 1998). Most of these studies claim that trusting another individual 
promotes more cooperative behavior from the trusted individual. In this study, we argue 
further that trust promotes cooperative behavior from the trusting individual. Specifically, 
preferences for fairness, equity, and reciprocity are expected to discourage one from taking 
advantage of a trusted collaboration partner (Fehr and Falk 2002). Similarly, anticipated 
feelings of guilt are believed to dissuade one from behaving opportunistically against a 
trusted collaboration partner. 


H2: The level of cooperation will be greater for collaborators with higher perceptions 
of each other's trustworthiness. 


When viewed together, these hypotheses present a self-reinforcing effect of control on 
cooperation. That is, the marginal benefit of control increases because of the trust that 
control-induced cooperation engenders. This self-reinforcing control process provides the 
basis for our fundamental claim that, contrary to prior research findings, control systems 
can increase trust and cooperation in collaborative settings. 
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III. EXPERIMENTS 

We conducted two experiments. In both experiments we manipulated the presence or 
absence of a control system. Experiment 1 was conducted in the tradition of psychology- 
based experimental work, and allowed us to test H1a and Hlb, related to the effect of 
control on cooperation and the effect of cooperation on trust. Experiment 2, which took 
the approach of experimental economics, allowed us to verify these effects and to test H2, 
related to the effect of trust on subsequent cooperation. Each experiment involved the 
interaction of two individuals in a collaborative arrangement. The two experiments are 
described in detail below, and our operational model is pictured in Figure 2. 


Experiment 1 
Method 

A total of 82 undergraduate (sophomore and junior status) students were recruited for 
Experiment 1 from an introductory management accounting course at a large state univer- 
sity. Participants received $10 for their participation in this experiment, which involved a 
two-way between-subjects design. The independent factor is the control system (present or 
absent). Further, each observation comprises two participants, who fulfill the roles of col- 
laborator and observer. The sessions were conducted in sequence on the same day with the 
Observer sessions following the collaborator sessions. Each session lasted approximately 30 
minutes. Participants' roles are discussed below in turn. 

Participants in the role of collaborator read a scenario, in which they were instructed 
to assume the role of a research and development (R&D) manager at a large pharmaceutical 
company. As such, the participants had to decide whether to devote most of the division's 
limited resources to its individual projects or to a project being conducted jointly with 
another R&D division. The scenario specified that top management preferred that resources 
be dedicated to the joint project, and that the participant had previously committed to devote 
a high level of resources to the joint project. The scenario presented a dilemma, however, 
in that if the joint project failed, it would be difficult for top management to determine 
which division was to blame, whereas if the individual projects failed due to lack of re- 
sources, the participants would be held personally responsible. While the scenario did not 
describe an economic game with great specificity, the scenario left little doubt that the 
principled course of action would be to dedicate one's resources to the joint project because 
of the previous commitment. Further, investment in the joint project was cooperative, in 
that the joint project drew on the expertise of both divisions and was beneficial to the firm 
as a whole (and by simple extension, to the other R&D manager). Thus, social norms would 
dictate investing in the joint project, and a trustworthy individual would uphold these norms. 
However, the participants had to weigh these norms against the cost of failing on the 
individual projects. 

Those participants in the control system condition were also instructed that a consultant 
had been hired to make periodic, unannounced visits. If the consultant determined that a 
division had committed an insufficient level of resources to the joint project, top manage- 
ment would likely penalize that division in the form of reduced budgets in the future, or a 
lower bonus to the division manager. Participants in the no control system condition did 
not face the potential of consultant visits. 

In addition to choosing to dedicate high resources to individual or joint projects, each 
participant in the role of collaborator wrote an essay that described his/her thought proc- 
esses and reasoning behind the resource allocation decision. The participants in the observer 
role then reviewed these collaborator responses to assess the collaborators' trustworthiness. 
Importantly, collaborators were randomly assigned to control system conditions, such that 
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FIGURE 2 
The Effect of Control on Trust and Cooperation 
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the inherent trustworthiness of these individuals would not be expected to differ across 
conditions. | 

Each observer received the full experimental package— scenario and responses—from 
one collaborator. The observer's task was to read these materials and to make several 
judgments regarding the trustworthiness of the collaborator. More precisely, observers 
judged the collaborator's trustworthiness and cooperativeness, and the degree to which the 
collaborator was a “team player." Recall that we expect (and will verify) that participants 
in the control svstem condition will cooperate more often than participants in the no control 
system condition. The tendency of individuals to attribute behavior to dispositional char- 
acteristics suggests that this control-induced cooperation will lead to positive attributions 
of the collaborators' trustworthiness. Thus, we predict that observers will judge collabora- 
tors in the control system condition to be: more trustworthy than collaborators in the no 
control system condition (Hla) and that this. effect will be mediated by observed cooperation 
(H1b). | 
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Experiment 1 Results 

Table 1 presents descriptive statistics for Experiment 1. We asked a number of manip- 
ulation check questions to ensure that participants understood the scenarios and attended 
to the manipulations. These manipulation checks were answered correctly by substantially 
all participants, and results are inferentially identical if those participants failing manipu- 
Jation checks are eliminated.? Perhaps most importantly, we asked observer participants to 
identify (1) the control system condition, and (2) the resource allocation choice made by 
the collaborator participant (1.e., individual or joint projects). For each manipulation check, 


TABLE 1 
Experiment 1 
Descriptive Statistics 
Panel À: Collaborator Choices 
Condition 
No Control Control Difference 
System System U-stat 
Number (96) of participants investing high 5 15 102.5** 
resources in Joint Project* (24%) (75%) 
(n = 21) (n = 20) 
Panel B: Observer Judgments 
— Cono «—& ^ 
No Control Control Difference 
System System t-stat 
Mean (Std. Deviation) Ratings of Collaborators? | 
Trustworthiness 5.62 7.15 2.15** 
(1.83) (1.73) 
Team player | 4.38 6.75 2.65** 
(2.97) (2.73) 
Cooperativeness 5.43 6.70 1.76* 
(2.31) (2.30) 
(n = 21) (n = 20) 


* ** Groups significantly different at p < 0.05 and p < 0.01, respectively, one-tailed. 

* Collaborators invested a high level of resources in either the joint project or individual projects. 

b Observers rated collaborators' trustworthiness, cooperativeness, and the extent to which the collaborator was a 
team player on a 1—10 Likert scale. 


> One exception is that ten out of the 41 collaborator participants responded that the case stated they had ‘‘pre- 
viously expressed a willingness to devote a low level of resources to the joint project" when, in fact, the case 
states that the division head (their assumed role) had expressed a willingness to commit a high level of resources. 
However, these manipulation check failures appear to have been caused by a misunderstanding of the question. 
In all but one of these cases, the collaborator's actual choice was to commit high resources to the individual 
(i.e., not joint) project. Thus, in this manipulation check, these collaborator participants appear to have been 
reporting the choice they actuallv made, rather than the prior commitment described in the case scenario (the 
intended question). This anomaly does not present inferential concerns for several reasons. First, the ten collab- 
orators missing this question were fairly evenly distributed across conditions: four were in the control system 
condition, and six were in the no control system condition. Second, an examination of the writers’ essays 
provided no evidence of an actuel misunderstanding. In neither condition did any writer allude to a belief that 
he had not made a prior commitment to devote high resources to the joint project. Finally, results are inferentially 
identical if these participants are eliminated. 
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all but one observer participant correctly named the corresponding control condition and 
allocation choice, indicating that observer participants gave reasonable attention to the de- 
tails of the experiment. 

In developing our hypotheses, we take as given that participants will cooperate more 
in the presence of the control system than when there is no control system. As presented 
in Table 1, Panel A, our experimental manipulations indeed provide this necessary condi- 
tion. Recall that in this experiment, a division's choice to devote high resources to the joint 
project rather than individual projects demonstrates cooperation. In the no control system 
condition, only 24 percent (5/21) of collaborators chose to devote high resources to the 
joint project. In the control system condition, 75 percent (15/20) of collaborators devoted 
high resources to the joint project. The Mann-Whitney U-test provides statistical evidence 
that these proportions are significantly different (U = 102.5, p < 0.01, one-tailed).!'? 

Hypotheses 1a and 1b are the primary focus of Experiment 1 and predict that when 
observing control-induced cooperation, participants will not fully attribute the cooperation 
to the presence of a control system. Specifically, Hla predicts that those collaborators in 
the control system condition will be judged as more trustworthy than those in the no control 
system condition. Recall that each observer was provided with the complete experimental 
package (scenario and responses, including the essay) of one collaborator. The observers 
then answered a series of questions on their perceptions of the collaborators' characteristics. 
While individual differences in innate trustworthiness may be revealed in the essays, note 
that participants were randomly assigned to the control system versus no control system 
conditions. Thus, the inherent trustworthiness of participants is not expected to vary across 
conditions so any systematic differences in essays are the result of the control system 
manipulation. 

To test Hla, we rely on the responses of those participants in the role of observer (see 
Table 1, Panel B). The primary dependent variable is (perceived) Trustworthiness, the ob- 
servers’ Likert scale (1—10) level of agreement with the statement, “The participant is a 
trustworthy individual." A t-test finds that control system condition collaborators were 
judged by their observers to be significantly more trustworthy than were the no control 
system condition collaborators (7.15 versus 5.62; t = 2.75, p < 0.01, one-tailed). This 
finding is particularly notable because when the observers were asked to recall whether 
there was a control system in place, only one answered this question incorrectly. Thus, 
despite the fact that the observers understood the economic incentives facing the collabo- 
rators, observers at least partially attributed the collaborators' behavior to their inherent 
trustworthiness." 

Hypothesis 1b predicts that the effect of a control system on perceived trustworthiness 
will be mediated by cooperation. We conduct a mediation analysis using the three-step 
method proposed by Baron and Kenny (1986). First, we establish that the independent 
variable (Control System) has a significant effect on the mediating variable (Cooperation). 
This is accomplished using the results of the Mann-Whitney U-test reported above (and in 


? Because this scenario is not defined with economic specificity, we cannot compare the actual cooperation levels 
to any economically rationalized benchmarks. Rather, we can only provide evidence that economic incentives 
favored cooperation more in the control system condition (due to the possibility of economic sanctions if a 
consultant reported low cooperation) than in the no control system condition. As described below, Experiment 
2 provides sufficient specification to compare the cooperation levels to what would be expected if an individual 
were motivated only to maximize wealth. 

U The Hla result is corroborated with a MANOVA, in which control system (present or absent) is the independent 
factor. The dependent variables are the Likert scale (1—10) levels of agreement with three statements, “The 
participant is a trustworthy individual," "The participant is a team player," and “The participant is a cooperative 
individual.” The result is consistent with the univariate test on the ‘‘trustworthiness”’ statement. : 
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Table 1, Panel A), which finds a significant effect of the control system on cooperation (U 
— 102.5, p « 0.01, one-tailed). Second, we regress the dependent variable (Trustworthiness) 
on the independent variable (Control System). This regression is statistically equivalent to 
the t-test reported above for H1a (and in Table 1, Panel B), and finds that a control system 
has a significantly positive effect on perceived trustworthiness (t = 2.75, p < 0.01, one- 
tailed). Third, we regress the dependent variable (Trustworthiness) on both the independent 
variable (Control System) and the mediator (Cooperation). This multiple regression (not 
tabulated) finds a significant coefficient on cooperation (t = 2.05, p < 0.05, one-tailed), 
and a nonsignificant coefficient on control system (t = 1.41, p = 0.17, one-tailed). 

In sum, once we control for the effect of cooperation on perceived trustworthiness, the 
direct effect of the control system on perceived trustworthiness is no longer significant. 
Baron and Kenny (1986) point out the need to compare not only the statistical significance, 
but also the absolute size of coefficients across regressions. Such a comparison finds that 
once we control for the effect of cooperation on perceived trustworthiness, the coefficient 
on the control system becomes smaller (0.88 in the stage 3 regression versus 1.53 in the 
stage 2 regression). Together, these findings support our H1b prediction that the effect of 
a control system on perceived trustworthiness is mediated by cooperation. In fact, we find 
that cooperation perfectly mediates the relation, in that the direct effect of the control system 
on perceived trustworthiness is not significant when the effect of cooperation is controlled.!? 


Experiment 2 
Method 

While Experiment 1 was designed to explore the effect of control on trust, Experiment 
2, which provides a more precisely specified incentive structure, also provides evidence on 
the effect of trust on subsequent cooperation. Furthermore, since this experiment allowed 
participants to interact over multiple periods, participants were able to develop richer in- 
teractive histories, an important basis for making attributions of trustworthiness (Kramer 
1999). 

A total of 62 undergraduate (junior and senior status business majors enrolled in fi- 
nance, marketing, audit and tax classes) and graduate (M.B.A. students enrolled in in- 
troductory management accounting classes) students were recruited to participate in Ex- 
periment 2. Each participant was assigned to a two-person collaboration, with each 
collaboration representing an independent observation. While each person was paired with 
the same collaboration partner throughout the experimental session, these pairings were 
anonymous. Each person knew only the participant number, but not the identity of the 
person with whom s/he was paired. Communication among participants was completely 
restricted, and all decisions were made and recorded via a computer network. 

As in Experiment 1, each participant assumed the role of an R&D manager at a large 
pharmaceutical firm. For each period, s/he decided how much of the division's R&D re- 
sources to devote to a joint R&D project. There were only two choices: a high level of 
resources or a low level of resources. Participants made this decision for 30 separate pe- 
riods—earning points based on his/her decisions, and those of his/her collaboration partner 
(i.e., the other division's R&D manager). At the end of the experiment, points were tallied 
and each participant was paid $1 for every 25 points earned. 


12 Commitment Theory (Nesse 2001) alternatively suggests that a control system is a commitment signal if it is 
self-imposed. Since the control system in our study is imposed exogenously, it is unlikely to provide such a 
signal. However, the degree to which commitments are upheld is a potential alternate interpretation of observed 
cooperation, and hence of the mediating variable through which a control system may engender trust. 
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The experimental parameters are summarized in Exhibit 1. Each participant incurred a 
cost of 15 points if s/he decided to devote a high level of resources to the joint project.” 
This cost provided an incentive for the participants to devote only a low level of resources 
to the joint project. However, the two collaboration members shared the income from the 
joint project, and this income increased with the level of resources dedicated. Each partic- 
ipant's base payoff each period was 50 percent of the joint project's income minus the 
participant's resource cost. For example, if one participant selected a low level of resources 
and his/her partner selected a high level of resources, then the participant earned a base 
payoff of 15 points for that period (0.50*30 — O = 15) while the partner earned a 
base pay of zero (0.50*30 — 15 = 0). 


EXHIBIT 1 
Experiment 2 
Payoff Structure 


Panel A: Summary of Payoff Structure 


e If either collaborator devoted high resources to the joint project, that collaborator's 
division was charged 15 points. | 
e Joint project profits were increasing in the level of resources dedicated to the joint 
ject: 
e If both divisions chose low, joint project CE 
e If one division chose low and one chose igh, a project profit = 30. 
* If both divisions chose high, joint project profits = 
e Joint p Pues profits were shared y by the two divisions. 
e° Control System Condition Only-—If the consultant visited and determined that a 
division had contributed high resources, that division would be rewarded 15 POIDS 
(Probability of consultant visit = 8096 each period.) 


Panel B: No Control System Condition—Normal Form Representation Payoff Structure" 
Collaborator 2 
Collaborator 1 low high 


high mo | 


Panel C: Control System Condition—Normal Form Representation Payoff Structure” 
Collaborator 2 
Collaborator 1 low high 


"A us 


* Represents level of resources dedicated to the joint project (low or high). 

** Represents the payoff to Collaborator 1 and Collaborator 2, respectively. 

* Calculated as 50 percent of joint profit minus resource cost (15 points if the division chose high resources). . 

? Calculated as 50 percent of joint profit minus resource cost (15 points if the division chose high resources) 
plus bonus (80 percent X 15 points if the division chose high resources). 


13 This deduction is analogous to the cost (unspecified in amount) of reduced performance on individual projects 
in Experiment 1. 
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The design included one independent factor: the presence or absence of a control 
system. In the control system condition, participants were told that an auditor would make 
random and unannounced visits to each division's site. At the end of these visits the auditor 
reported to both managers the level of resources each manager contributed to the joint R&D 
projects. Thus, the control system provided feedback regarding each participant's choices 
to both participants. The probability of a visit from the auditor was 80 percent in a given 
period, and if the auditor reported that a manager had selected a high level of resources, 
this manager received a 15-point bonus.’* Again, participants were randomly assigned to 
conditions such that the inherent trustworthiness of these individuals would not be expected 
to vary across conditions. 

Without considering the effects of the control system (the periodic auditor visits and 
associated bonus), all participants faced a two-party social dilemma (i.e., a prisoner's di- 
lemma), where cooperation is a relevant social norm. Specifically, each participant had 
individual incentives to dedicate a low level of resources to the joint project, while both 
participants received a greater payoff if both contributed high resources than if both con- 
tributed low resources. However, once the control system is incorporated, the final game is 
no longer a social dilemma. That is, when there was a control system in place, it was in 
each player's individual best interest to cooperate (i.e., dedicate a high level of resources). 
Thus, we again assume that participants in the control system condition will cooperate more 
often than those in the no control system condition. 

Hypothesis 1a predicts that participants in the control system condition will be judged 
as more trustworthy than those in the no control system condition. This prediction depends 
on the ability of participants to observe each other's control-induced cooperation. Therefore 
we allow for 20 periods of play before assessing the perceived trustworthiness of each 
participant's partner. Once participants completed Period 20, the experimenter announced 
that for the final ten periods the auditor would not visit. Participants were reminded that 
because the auditor would not be visiting, they would no longer learn their collaboration 
partners' resource allocation choices. Thus, in the periods after Period 20, the incentives 
going forward are identical in the control system and no control system conditions. However, 
the conditions differ in terms of the interactive histories that have developed. 

Before playing the last ten periods, participants judged the likelihood that their collab- 
oration partners would behave cooperatively. Specifically, they answered, “How likely is it 
that your partner will select a high level of resources these final periods?" on a Likert scale 
(1-7). Recall that our definition of trust may be conceptualized as the perceived likelihood 
that another person will cooperate, absent any economic incentives to do so. Because this 
question was asked after Period 20, when the prospect of an auditor's visit was removed, 
this question operationalizes our definition of trust. ^ Hypotheses la and 1b jointly predict 
that this perceived likelihood will be greater for those participants in the control system 
condition than for those in the no control system condition. Hypothesis 2 predicts that 
subsequent cooperation will be greater for those collaborations that exhibit greater levels 
of trust. We test H2 by examining the level of cooperation in the last ten periods. 


14 Because Experiment 2 used actual monetary payments, we had to precisely define how this pay would be 
calculated. In describing the algorithm, we could have used either a bonus or a penalty scheme. We report the 
results from using a bonus scheme. However, additional experimental data were collected using a penalty scheme. 
Results from this version of the experiment are inferentially identical to those reported here. 

15 Rather than asking direct questions about the partner's trustworthiness, integrity, etc., we made this question as 
emotionally neutral as possible so that decisions in the remaining ten periods were not unintentionally affected. 
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Experiment 2 Results 

Table 2 presents descriptive statistics for Experiment 2, and the trends in cooperation 
are pictured in Figure 3. Because of the computerized nature of the experiment, basic 
manipulation check questions could be asked after the instructions but before the partici- 
pants took part in the experiment. These questions took the form of a four-question quiz, 
and all questions had to be answered correctly before the participant could begin the ex- 
periment. Therefore, all participants passed all manipulation checks. 


TABLE 2 
Experiment 2 
Descriptive Statistics 
Condition —.  —— 
No Control Control Difference 
Collaborator Choices System System t-stat 
Mean (Standard Deviation) 
Cooperation—Periods 1—20* 17.67 34.56 8.97** 
(6.41) (3.85) 
Average per period (Total/20) 0.88 1.73 
Cooperation— Periods 21—30* 7.80 10.88 1.78* 
(4.01) (5.45) 
Average per period (Total/10) 0.78 1.09 
Trust? 4.23 5.06 2:17* 
(1.21) (0.91) 
(n = 15) (n — 16) 


*,** Groups significantly different at p « 0.05 and p « 0.01, respectively, one-tailed. 

* Cooperation— Periods 1—20 is the cumulative number of times the two people in a partnership chose to 
dedicate a high level of resources to the joint project during the first 20 periods (range: 0—40). 

b Cooperation—Periods 21—30 is the cumulative number of times the two people in a partnership chose to 
dedicate a high level of resources to the joint project during the last ten periods (range: 0-20). 

* Trust is the collaborators' assessment of their partners' trustworthiness on a 1—7 Likert scale. Specifically, 
collaborators answered “How likely is it that your partner will select a high level of resources these final 
periods?" after being told that no control system would be in place for those final ten periods (periods 21—30). 
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The first purpose of Experiment 2 is to corroborate the Hla and H1b results from 
Experiment 1, using a more precisely specified economic structure. After reporting these 
results, we proceed to H2, the testing of which is the primary purpose of Experiment 2. 
We again assume (and test) that cooperation is higher for those participants in the control 
system condition than for those in the no control system condition; that is, participants in 
the control system condition will exhibit the control-induced cooperation necessary for trust 
to develop. The dependent variable for this test is Cooperation—Periods 1-20, the cumu- 
lative number of times the two people in a collaboration chose to dedicate high resources 
to the joint project in the first 20 periods. For a given period, this variable could equal 0, 
1, or 2, representing a collaboration in which neither, one, or both partners dedicated high 
resources to the joint project. Thus, the cumulative variable for all 20 periods ranges from 
O to 40.!5 In Table 2, a t-test confirms that collaborations in the control system condition 
cooperated (chose high resources) more often in the first 20 periods than teams in the no 
control system condition (34.56 versus 17.67; t — 8.97, p « 0.01 one-tailed), satisfying the 
necessary condition that the control system induces cooperation.” 

Hypothesis 1a predicts that other collaborators will positively attribute the documented 
control-induced cooperation to the collaborators' trustworthiness. Recall that in Experiment 
2, after 20 periods of play, participants in the control system condition were informed that 
the auditor would not visit again. Thus, participants in the control system and no control 
system condition faced the same “‘no control system” situation going forward in the final 
ten periods. After the announcement was made, all participants assessed the likelihood (on 
a Likert scale) that their collaboration partners would dedicate high resources to the joint 
project in the final ten periods. This variable, Trust, operationalizes our concept of trust as 
the perceived likelihood that a partner will cooperate, absent any economic incentives to 
do so. In Table 2, the responses are averaged for each pair, and a t-test confirms that 
participants in the control system condition expect a higher level of cooperation than those 
in the no control system condition (mean Trust values of 5.06 versus 4.23; t — 2.17, p 
< 0.05, one-tailed). This supports Hla, and allows us to conclude that the control system 
had a positive effect on trust. 

Hypothesis 1b predicts that the effect of a control system on trust will be mediated by 
cooperation. As for Experiment 1, we conduct a mediation analysis using the three-step 
method proposed by Baron and Kenny (1986). First, we establish that the independent 
variable (Control System) has a significant effect on the mediating variable (Cooperation). 
This is accomplished using the t-test reported above (and in Table 2), that documents a 
significant positive effect of the control system on Cooperation— Periods 1—20 (t = 8.97, 
p < 0.01). Second, we regress the dependent variable (Trust? on the independent variable 
(Control System). This regression is statistically equivalent to the t-test reported above for 
Hla (and in Table 2), and finds that a control system has a significantly positive effect on 


16 The average total collaboration resources in periods 1—20 is significantly greater than the economic prediction 
of 0 in the no control system condition (t = 10.7, p < 0.01), and significantly less than the economic prediction 
of 40 in the control system condition (t — 5.7, p « 0.01). Thus, while economic factors play a role, as evidenced 
by the significant difference in cooperation across conditions, other factors, such as individual preferences and 
social norms, are also factors in determining the level of cooperation. 

" Multiple period games such as ours often afford the opportunity for trend analysis (e.g., repeated measures 
ANOVAs). Such analysis is difficult in our setting because, for each period, the dependent variable is trichot- 
omous, rather than continuous. However, casual observation of changes in cooperation over time reveals no 
trends, largely because our control system was strong enough to próvide near perfect cooperation, even in the 
earliest periods (e.g., the average level of cooperation in the first five periods was 1.7 out of 2 in the control 
system condition). 
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Trust (t = 2.17, p < 0.05, one-tailed). Third, we regress the dependent variable (Trust) on 
both the independent variable (Control System) and the mediator (Cooperation). This mul- 
tiple regression (not tabulated) estimates a significant coefficient on Cooperation—Periods 
1—20 (t = 2.26, p < 0.01, one-tailed), and a nonsignificant coefficient on control system 
(t = 0.75, p = 0.46, one-tailed). In other words, once we control for the effect of coop- 
eration on trust, the direct effect of the control system on trust is no longer significant. 
` Further, once we control for the effect of cooperation on trust, the coefficient on the control 
system becomes smaller (0.26 in the stage 3 régression versus 8.45 in the stage 2 regres- 
sion). Thus, as is true for Experiment 1, we find that cooperation perfectly mediates the 
effect, of the control system on trust (i.e., perceived trustworthiness). l 

While the Hla, and H1b results from Experiment 1 were corroborated by the Experi- 
ment 2 results, the most significant purpose of Experiment 2 was to investigate the effect 
of trust on subsequent behavior and in the absence of the control system. Hypothesis 2 
predicts that the level of cooperation will be greater for collaborators with higher percep- 
tions of each other’s trustworthiness. The independent variable for this test is the dependent 
variable from the test of Hla, the perceived likelihood of cooperation in Periods 21—30 
when the possibility of an audit exists in neither condition (i.e., Trust). The dependent 
variable is Cooperation—Periods 21-30, measured as the cumulative number of times the 
two people in a partnership chose to dedicate a high level of resources to the joint project 
during the last ten periods. This variable ranges from 0 to 20, with 20 representing a pair 
in which both participants chose high resources in each period. À simple regression of 
Cooperation—Periods 21—30 on Trust (see Table 3) finds a significantly positive effect 
(coefficient of 2.40; t — 3.49, p « 0.01, one-tailed). Thus, H2 is supported, and we conclude 
that, upon the removal of the control system, the level of cooperation is greater for collab- 
orators with higher perceptions of each other's trustworthiness. 

The major implication of this finding is that the effect of control on cooperation is 
reinforced by the trust engendered by control. In periods 21—30, no control system is in 
place in either condition, and any difference in cooperation must be due to the level of 
trust generated by the experiences in the earlier periods. From Table 2, we find that re- 
moving the control system does lead to an overall decrease in cooperation.!? Even so, 
participants in the control system condition cooperate (dedicate high resources to the joint 
project) significantly more in the final ten periods (i.e., following removal of the control 
system) than participants in the no control system condition (10.88 versus 7.80; t — 1.78, 
p = 0.04, one-tailed). 

A trend analysis presented in Figure 3 highlights that most of the drop-off in cooper- 
ation occurred in the first few periods after Period 20, with cooperation levels hitting a low 
point of 0.9 in Period 24. This rapid decline in cooperation likely reflects a growing ap- 
preciation of the change in the incentive structure and its effect on partner cooperation. 
After this point, cooperation levels seemed to stabilize. Notably, even after the erosion in 
cooperation when the control system is removed, some residual effect of trust seems to 
remain. Specifically, cooperation in the final seven periods is higher in the control system 


'* Considering only the control system condition, the average level of cooperation was 1.7 (out of 2.0) in periods 

. 1-20, and 1.1 (out of 2.0) in periods 21—30. The Wilcoxon Signed Ranks test confirms that these proportions 
are significantly different (Z — 3.18, p « 0.01). This finding illustrates that participants not only understood 
that the control system had been removed but altered their behaviors accordingly. To ensure that this difference 
is due to the removal of the control system, rather than end-of-game effects, we confirm the comparison using 
only the two, four, six, or eight periods on either side of period 20. In all cases, we find that cooperation 
decreased signiticantly after period 20. 
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TABLE 3 
Experiment 2 
Regression of Cooperation—Periods 21—30 on Trust 


Model: Cooperation—Periods 21-30, = B, + B, Trust, + e, 


Coefficient 
Dependent Variable: Cooperation— Perlods 21—30* (t-statistic) 
Intercept — 1.788 

(—0.543) 
Trust? 2.397** 

(3.487) 

F-statistic 12.152** 
Adjusted R? 27.196 
n (collaboration scores) 31 


** Estimate is statistically significant at p « 0.01 (one-tailed) 

* Cooperation —Periods 21—30 is the cumulative number of times the two people in a partnership chose to 
dedicate a high level of resources to the joint project during the last ten periods (range: 0—20). 

> Trust is the collaborators’ assessment of their partners’ trustworthiness on a 1—7 Likert scale. Specifically, 
collaborators answered “How likely is it that your partner will select a high level of resources these final 
periods?" after being told that no control system would be in place for those final ten periods (periods 21-30). 
This Likert scale response is averaged across the two members of each collaborative partnership. 


condition than in the no control system condition at a marginally statistically significant 
level (F = 1.75, p = 0.09, one-tailed). Thus, we conclude that despite the potentially 
negative effects discussed in the prior literature, control systems can actually enhance trust, 
and this enhanced trust can increase future cooperation. 


IV. CONCLUSION 

In summary, this paper investigates the use of control systems to reduce relational risk 
and increase trust and cooperation in collaborative environments. We provide evidence that 
in collaborative settings, control induces cooperation, which, in turn, positively affects trust. 
Specifically, control systems aimed at reducing relational risk promote greater cooperation, 
which is observed by participating collaborators. This observed cooperative behavior allows 
collaborators to build trust in one another, and this trust reinforces the positive effects of 
the control system in eliciting future cooperation. Thus, the effect of control on cooperation 
is reinforced by the trust that it engenders. This finding is particularly notable, given the 
large body of literature suggesting just the opposite—that control systems reduce trust. 

The findings of this study have several implications for academic research. Prior re- 
search suggests that the presence of control systems reduces trust. However, these studies 
hold cooperation levels constant or do not provide feedback on collaborative cooperation, 
and, therefore, underestimate the potential benefits of strong control systems. Our study, on 
the other hand, provides evidence that control systems can, in fact, lead to higher levels of 
perceived trustworthiness and, ultimately, future cooperation. The mediating role of control- 
induced cooperation provides the mechanism by which this positive effect occurs. Taken 
together, the results of this study suggest an increasing marginal benefit of control system 
strength arising from the trust that observed control-induced cooperation engenders. The 
implication is that firms will choose to implement a stronger control system than previous 
research would seem to suggest. 
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The findings of this research also have implications for the practice of management 
accounting, specifically, in the design and implementation of control systems in collabo- 
rative settings. Note that the trust-building benefits of control exhibited in this paper may 
not be experienced in collaborative settings without feedback mechanisms—that is, in set- 
tings in which control-induced cooperation cannot be observed. This implies that some 
control mechanisms—namely, monitoring and mutual performance measurement—may be 
more important in collaborative settings than previously recognized. Moreover, because our 
theory suggests that the trust benefits of control accrue over time (1.e., via observation of 
control-induced cooperation), the timing of the implementation of control systems may be 
important. The earlier a strong control system is put into place, the larger are the ultimate 
trust and cooperation levels achieved. McKnight et al. (1998, 475) suggest that the "'ro- 
bustness"' of trust depends, in part, on “structural assurances" related to “the security one 
feels about a situation because of guarantees, safety nets, or other structures." By providing 
these structural assurances, strong control systems can provide for high levels of trust early 
in the collaborative relationship. 

These implications for management practice suggest that a closer examination of the 
effect of control on trust and cooperation over time is warranted. In particular, future re- 
search should examine whether the trust engendered by observed control-induced cooper- 
ation is self-sustaining. Figure 1 suggests that reductions in control necessarily lead to 
reductions in trust. However, if trust is self-sustaining, it may be possible to maintain 
cooperation levels while simultaneously reducing the strength (and, hence, the cost) of the 
control system. If true, then this would help explain the existence of successful collabora- 
tions that appear to be governed by weaker explicit control mechanisms. Furthermore, it 
would provide even more support for the notion that firms should implement stronger 
control systems in the earliest stages of a collaboration. 

One limitation of the current study is our choice to model only intra-firm collaborations 
within the experimental design—that is, we focused on teams within a firm. Because our 
theory applies to collaborations in general, and does not depend on the collaboration ex- 
isting within the boundaries of a firm, we expect our results to apply to inter-firm collab- 
orations as well. There are, however, differences between intra- and inter-firm collabora- 
tions, one of which is the cognitive predisposition of intra-firm collaborators to feel as if 
they are on the “same team.” Indeed, Towry (2003) documents that this sense of “team 
identity” increases the probability of cooperation. While we know of no theories suggesting 
that such an effect would negate the findings of the current study, we leave to future research 
an investigation of the degree to which our findings generalize to inter-firm collaborative 
settings. 

A second potential limitation of the current study is our choice to use a relatively strong 
control system. The next logical step is to identify the boundary conditions within which 
a control system will positively affect trust. For example, a weaker control system may 
provide a feedback mechanism, but fail to provide strong enough incentives for cooperation. 
In such a case, the feedback may reveal shirking and, as a result, lead to heightened levels 
of distrust among collaborative partners (see Figure 1, Panel B). More generally, a fruitful 
avenue for future research lies in providing more insight into the role of information in 
strengthening (or weakening) the links between control, trust, and cooperation identified in 
this paper. 
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ABSTRACT: We develop an empirical method that allows us to evaluate the reliability 
of an expected return proxy vla its association with realized returns even If realized 
returns are biased and nolsy measures of expected returns. We use our approach to 
examine seven accounting-based proxies that are imputed from prices and contem- 
poraneous analysts' earnings forecasts. Our results suggest that, for the entire cross- 
section of firms, these proxles are unreliable. None of them has a positive association 
with reallzed returns, even after controlling for the blas and noise in realized returns 
attributable to contemporaneous information surprises. Moreover, the simplest proxy, 
which is based on the least reasonable assumptions, contains no more measurement 

` error than the remaining proxies. These results remain even after we attempt to purge 
the proxies of their measurement error via the use of instrumental variables and group- 
ing. We provide additional evidence, however, that demonstrates that some proxies are 
reliable when the consensus long-term growth forecasts are low and/or when analysts' 
forecast accuracy Is high. 


Keywords: cost of capital; expected rate of retum; earnings forecasts; residual income 
valuation; measurement error. 


I. INTRODUCTION 


e develop an empirical approach for evaluating the reliability of estimates of the 
expected rate of return on equity capital. We use our approach to examine seven 
accounting-based proxies that are imputed from prices and contemporaneous 
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analysts’ earnings forecasts.! We show that for the entire cross-section of firms, none of 
these proxies has a positive association with realized returns after controlling for changes 
in expectations about future cash flows and future discount rates. Moreover, a naive measure 
of expected return (the inverse of price to forward earnings) contains no more, and often 
less, measurement error than the remaining proxies. We provide additional evidence, how- 
ever, that demonstrates that some proxies are reliable when the consensus long-term growth 
forecasts are low and/or when analysts' forecast accuracy is high. 

similar to the majority of studies in the empirical asset-pricing literature, our inferences 
about the reliability of a particular expected return proxy are based on its association with 
realized returns. However, unlike these studies, we assume that realized returns are biased 
and noisy measures of expected returns. This assumption is motivated by evidence presented 
in Elton (1999) and Fama and French (2002). These authors demonstrate that “information 
surprises," which cause realized returns to differ from expected returns, do not cancel out 
over time or across firms.? We show that these information surprises are also correlated 
with expected returns. Taken together, these observations imply that simple regressions of 
realized returns on expected return proxies yield spurious inferences because of omitted 
correlated variables bias. 

In light of the above, we adopt an approach that explicitly takes into account the bias 
and noise in realized returns. Our approach is based on the linear return decomposition 
developed by Vuolteenaho (2002), who demonstrates that information surprises equal the 
change in expectations about future cash flows (i.e., cash flow news) less the change in 
expectations about future discount rates (1.e., return news). Hence, he provides a theoretical 
foundation for a regression of realized returns on proxies for expected returns, cash flow 
news, and return news. The coefficients from this regression serve as our initial source of 
evidence about the reliability of the expected return proxies. 

A problem with basing our inferences on the regression coefficients discussed above 
is that the estimates of these coefficients are affected by errors in variables bias. Because 
we cannot observe expectations and changes in expectations, all of the regressors (i.e., the 
expected return proxy, the cash flow news proxy, and the return news proxy) are measured 
with error. The sign and magnitude of the bias attributable to measurement error is generally 
unknown when more than one variable in a multivariate regression contains measurement 
error (e.g., Rao 1973). However, the return decomposition demonstrates that if the com- 
ponents of realized returns are measured without error, then the estimates of the slope 
coefficients in a regression of realized returns on expected returns, cash flow news, and 
return news are unambiguously equal to 1. Thus, the bias in the coefficients corresponding 
to our empirical proxies is well defined. This, in turn, implies that we can modify the 
econometric method described in Garber and Klepper (1980) and Barth (1991) to develop 


! Examples of studies that use accounting-based expected return proxies to evaluate the cross-sectional determi- 
nants of expected returns include: Chen et al. (2004), Dhaliwal et al. (2003), Gebhardt et al. (2001), Francis et 
al. (2003), Gode and Mohanram (2003), Hail (2002), Hail and Leuz (2004), Hribar and Jenkins (2004), and Lee 
et al. (2003). We illustrate our method by examining variants of several or the expected return proxies used in 
these studies. However, our method can be used to evaluate any expected return proxy including those based 
on forecasts of dividends and prices (e.g., Botosan 1997; Botosan and Plumlee 2002; Brav et al. 2004; Francis 
et al. 2004). 

2 Elton (1999, 1199) states: “The use of average realized returns as a proxy ior expected returns relies on a belief 
that information surprises tend to cancel out over the period of a study and realized returns are therefore an 
unbiased measure of expected returns. However, I believe there is ample evidence that this belief is misplaced.” 
Fama and French (2002) provide evidence that suggests the abnormally large equity premium observed during 
the post-war era was attributable to information surprises that took the form of consistent downward revisions 
in expected future discount rates. We elaborate on these issues in Section II. 
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rankings of measurement error vartances of the expected return proxies that serve as our 
second source of evidence about their reliability. 

Our empirical results suggest that, for the entire cross-section of firms, the accounting 
based proxies we evaluate are not reliable estimates of the expected rate of return on equity 
capital. None of the proxies has a positive association with realized returns even after we 
control for changes in expectations about future cash flows and future discount rates. More- 
over, the measurement error variance of a naive proxy based on the price-to-forward- 
earnings ratio is never greater, and often lower, than the error variances of the remaining 
proxies. These results are robust. For example, we evaluate the effectiveness of two com- 
monly used methods for mitigating measurement error: instrumental variables, and group- 
ing. Neither of these approaches lead to improvements in the associations between the 
expected return proxies and realized returns, nor do they affect the ordering of the mea- 
surement error variances (i.e., the measurement error variance.of the naive proxy is no 
different from the error variances of the remaining proxies). 

Further analyses, however, demonstrate that certain proxies are reliable for some subsets 
of the data. First, we show that the reliability of the proxies is decreasing in the magnitude 
of consensus long-term earnings growth rate, and that an analog of the measure of expected 
returns used by Claus and Thomas (2001) is a reliable proxy for a subsample of firms with 
low consensus long-term growth forecasts. We also demonstrate that when ex post analysts' 
forecasts errors are low, all of the proxies have a positive association with expected returns. 
Combining these two sets of results with the fact that ex post forecast errors are increasing 
in analysts' long-term growth forecasts leads us to draw two conclusions: (1) the lack of 
reliability for the general cross-section is partially attributable to low-quality analysts' fore- 
casts, and (2) the consensus long-term earnings growth rate is a useful ex ante indicator of 
reliability (the higher the forecasted growth, rate the lower the reliability). 

We contribute to the accounting and finance literatures in three ways. First, we develop 
an empirical approach for drawing unbiased inferences about an expected return proxy 
from its association with realized returns even if realized returns are biased and noisy. 
Second, we provide evidence suggesting that in most circumstances the expected return 
proxies we evaluate are unreliable. This implies that there is a need for further research on 
the development of accounting-based measures of expected returns, and that extant evidence 
based on the proxies we evaluate should be interpreted with caution. Finally, we provide 
evidence consistent with the notion that the apparent lack of reliability of our expected 
return proxies is partially attributable to the quality of analysts' earnings forecasts, which 
suggests that further study of the determinants of analysts' forecast errors is warranted.? 

Two other studies explicitly aimed at evaluating the reliability of accounting-based 
measures of expected returns are Botosan and Plumlee (2005) and Guay et al. (2003). 
Botosan and Plumlee (2005) rank measures of expected returns by comparing coefficients 
from regressions of expected return proxies on assumed risk factors (e.g., CAPM beta, 
equity market value, leverage, etc.). While this approach has intuitive appeal, it requires 
that the researcher make the implicit assumption that the risk factors evaluated are correct 
and exhaustive, which is unlikely. As discussed in Section II, the return decomposition that 
serves as the foundation for our tests is based on a tautology: that is, our analyses are based 


? Like other studies (e.g., Abarbanell and Bushee 1997; Francis et al. 2000) that document large valuation errors 
when analysts' forecasts are used in valuation, we are unable to distinguish between two possible explanations 
for the measurement error in the expected return proxies: (1) errors in the forecasts, and (2) the restrictive 
assumptions underlying the implementations of the valuation models used to obtain the estimates of the expected 
rate of return. 
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on a fully specified model of the relation between an expected return proxy and true 
expected return. 

The approach adopted by Guay et al. (2003) is similar to ours as they also evaluate 
the relation between various expected return proxies and realized returns. There is a crucial 
difference between our study and theirs, however; we include cash flow news and return 
news proxies in our regressions, whereas they do not. In particular, the evidence presented 
in Guay et al. (2003) is based on simple regressions of realized return on the expected 
return proxy. Hence, as discussed above, the associations they document are likely to be 
biased measures of the relation between the expected return proxies and true expected 
return. In Section IV, we provide evidence that bias of this nature exists and is non-trivial. 

The remainder of the paper unfolds in the following manner. In the next two sections 
we describe our empirical method, discuss our proxies of interest, and describe our sample. 
Our main empirical results are presented in Section IV. The results of our instrumental 
variables analyses, grouping analyses, and analyses of the relation between the reliability 
of the expected return proxies and analyst long-term growth forecasts are discussed in 
Section V. We provide concluding comments in Section VI. 


Il. EMPIRICAL METHOD 

We begin by describing Vuolteenaho's (2002) linear decomposition of realized return 
into three components: expected return, cash flow news, and return news. Vuolteenaho's 
(2002) decomposition forms the basis of a linear regression of realized return on the proxies 
for its components. After discussing this regression we describe how measurement error in 
the regressors leads to bias in the regression coefficients. Finally, we describe the manner 
in which we refine the method discussed in Garber and Klepper (1980) and Barth (1991) 
so that we can isolate the portion of the coefficient bias that is solely attributable to the 
measurement error in the expected return proxy. 


The Components of Realized Returns 


Vuolteenaho (2002) demonstrates that firm 7’s realized, continuously compounded re- 
turn for year t+], r,,,, can be decomposed into three components: (1) expected return, 
€r,41, (2) changes in expectations about future cash flows (cash flow news, cn,,,,), and (3) 
changes in expectations about future discount rates (return news, rn,,,,). In particular: 


Figg CT yyy Tj Nyy 7 Miei (1) 


In Equation (1) expectations underlying er,,, are formed at the end of year t, whereas 
cn,,, and rn,,, reflect revisions in expectations occurring during year t+1.* A detailed 
description of Vuolteenaho’s (2002) return decomposition, which is similar to the well- 
known return decomposition developed by Campbell (1991), is provided in Appendix A. 
Empirical proxies for expected return, er,,,, cash flow news, cn,,,, and return news, 
7n,,,, are described in Section III. 

Three observations about Equation (1) warrant mentioning. First, Equation (1) is de- 
rived from a tautology; hence, our analyses do not rely on implicit or explicit assumptions 
about investor rationality, the nature of market equilibrium, transactions costs, etc. Second, 
the linear return decompositions developed by Campbell (1991) and Vuolteenaho (2002) 


* In Equation (1) the negative sign on return news, mi, reflects the fact thet, ceteris paribus, i increases in future 
discount rates lead to a decrease in contemporaneous price and, thus, realized return, r,,,, is lower than expected 
return, er,,,. 
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are well accepted. For example, a number of studies In finance use variations of Equation 
(1) as a means of evaluating the determinants of realized returns (comprehensive literature 
reviews can be found in Campbell et al. [1997, Chapter 7] and Cochrane [2001, Chapter 
20]). Finally, since Equation (1) reflects the effect of changes in expectations about future 
cash flows and future discount rates on realized returns, it provides a direct means of dealing 
with Elton's (1999) argument that information surprises cause realized returns to be a biased 
and noisy measure of expected returns. 

The third point is especially pertinent to our study. Bias in realized returns implies that 
the estimate of the slope coefficient taken from a simple regression of realized returns on 
expected returns is also biased. If changes in expectations about future cash flows (discount 
rates) are associated with contemporaneous expected returns, the coefficient on expected 
returns will be affected by correlated omitted variables bias. This is quite plausible. For 
example, an explanation for the equity premium puzzle is that during the post-war period 
the U.S. (and other Western nations) experienced an unprecedented run of “good luck." 5 
Hence, the expected future rate of return required by investors as compensation for holding 
the market portfolio steadily declined (i.e., economy-wide rn,,, was negative), which, per 
Equation (1), caused the realized equity premium to be consistently larger than expected.^ 
This, in turn, led to higher than expected realized returns on individual stocks. Moreover, 
the magnitude of the bias at the individual stock level was arguably increasing in the 
covariance between a stock's return and the return on the market portfolio. It follows that 
changes in expectations about future discount rates (i.e., rn,,,) were correlated with both 
realized and expected returns (i.e., ry}, and er,,,), and the coefficient on er,,, is biased if 
7n,,,, 18 omitted from the regression." 


The Regression Based on Vuolteenaho's (2002) Return Decomposition 
We begin our analyses by estimating the following regression for each expected return 
proxy: 


Turi = Qorni T Oui X CP E 00,4 X Churi T Oai X rfi, + E. (2) 


In Equation (2) ef,.,, Cris and rfi,,, represent the expected return proxy, the cash flow 
news proxy, and the proxy for negative return news (i.e., —1 X m4,,), which we refer to 
as the return news proxy. The expected return proxies, the cash flow news proxies, and the 
return news proxies are described in Section III. If these empirical proxies are measured 
without error, c,,,,, 05,,,, and a,,,, are equal to 1 and a4,,, is equal to 0. Hence, one means 
of evaluating a particular measure of expected returns is to conduct a test of the difference 
between a,,,, and 1. Unfortunately, these tests do not lead to clear-cut inferences; because 
we are unable to observe expectations or revisions in expectations, each of the regressors 
in Equation (2) contains error, which implies the bias in a particular regression coefficient 
is a complex function of the measurement errors in all of the regressors (e.g., Rao 1973). 
To circumvent this problem we use a refinement of the approach discussed in Garber and 


5 See Cochrane (2001, Chapter 21, 460—462) for a discussion of the equity premium puzzle. The discussion under 
the heading "Luck and a Lower Target” is particularly relevant. 

$ Fama and French (2002) provide specific evidence of this phenomenon. 

7 A similar argument can be made for the inclusion of cn,,, in the regression. In particular, if changes in 
expectations about future cash flows are correlated with investment opportunities, which, in turn, are correlated 
with expected returns, cn,,,, will be correlated with both realized and expected return, Berk et al. (1999) develop 
a model in which firms' optimal investment choices are associated with expected returns. 
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Klepper (1980) and Barth (1991) to isolate the portion of the bias in a,,,, that is solely 
attributable to the measurement error in ef, ,,. 


Measurement Error Analysis 


In this subsection we describe the intuition underlying the method we use to isolate 
the portion of the bias in a,,,, that is solely attributable to measurement error in ef, ,. 
Appendix B contains a rigorous description of our econometric approach, which is centered 
on the following regression: 


Vos = Soeer T O44, X hat! + Bay. X Eee + Ôa X Ee + Wier (3) 


In Equation (3) v4,,, equals the difference between firm Ps observed, realized return for 
year t+1, and the sum of the empirical measures of its components (i.&, vgs, = 7,4 
— efe — cf, — rfij,,); thus, v,,,, equals the combined measurement error in our 
empirical proxies. Each regressor (i.e., €1,.,, £2,,,, and €3,4,) essentially equals the adjusted 
error from a regression of one of the proxies on the remaining two (e.g., e4,., is obtained 
by regressing ef +; on cf,,,, and rfi,,,). Hence, each regression coefficient in Equation (3) 
measures the relation between the error in a particular proxy and the combined error in all 
the proxies. The expression for the regression coefficient corresponding to £1,,,, which we 
refer to as the “noise variable," is:° 


UP o^ (vj41) + {OW iets Vui) T OO; Yared} + (Co(eruu 1. vui) 
+ o(erz,,, Vaiti) he (4) 


In Equation (4) o?(v,,,,) is the variance of the measurement error in the expected return 
proxy, 9(v,4,,, v4,,) denotes the covariance between the measurement error in ef;,,, and 
the measurement error in cfi, ,, CVt Wut) is the covariance between the measurement 
error in ef,,,, and the measurement error in rfl,,,, o(er,.,, Uor+1) is the covariance between 
true expected return and the measurement error in cA,,,.,, and o(er,,,, v44,,) is the covari- 
ance between true expected return and the measurement error in 771,,,. If the covariance 
terms are constant across expected return proxies, then variation across proxies in the noise 
variable is solely attributable to variation in the measurement error in the expected return 
proxies (i.e., o7(v,,,4;)). This implies that the expected return proxy with the smallest noise 
variable contains the least measurement error and is the most reliable. Hence, we begin our 
measurement error analyses by estimating Equation (3) for each of the expected return 
proxies of interest. 

A problem with using the noise variables to rank our expected return proxies is that it 
is unlikely that the covariance terms shown in Equation (4) are constant across expected 
return proxies. Because each of our measures of the expected rate of return is based on a 
unique set of assumptions about dividends, future earnings growth, and terminal profita- 
bility, it is likely that the relation between the error in each proxy and the errors in the 
remaining independent variables is also unique. This implies that inferences based on the 
relative magnitudes of the different estimates of the noise variable may not provide a 


* As shown in Appendix B, the expressions for 8,,,, and 8,,,, are similar to the expressions for 8,,,,. Given that 
our primary interest relates to the measurement error in the expected return proxies, we choose to focus the 
discussion in the body of the paper on 8,,,,. 
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meaningful basis for inferring the relative magnitudes of o?(v,,,,,). To circumvent this prob- 
lem we refine the econometric approach developed by Garber and Klepper (1980) and Barth 
(1991) and estimate “modified noise variables: ? 


OTe = O(n) — [o(er,,., Chiti) + OCCT y+ 1, Tuti) 
— {OM ps1 Chur) + TVs Musi). (5) 


In Equation (5) o(er,,,, CMa.) + o(erj,,, rn,,,) equals the covariance between true ex- 
pected return and the sum of true cash flow news and true return news. 

Two observations regarding the modified noise variables are pertinent. First, while there 
is reason to believe o(er,,,, cn44,,) + o(erz,,, 71,4) iS not equal to zero, it involves only 
the true values of the constructs, which implies it is constant across proxies. Hence, if 
differences in o(v,4,,, Cn, |) + T1441, My1) are second order, which is a reasonable 
assumption, differences in the modified noise variables are primarily attributable to differ- 
ences in the measurement error variances of the proxies (i.e., o?(v,,,,)).? Second, 8ff,, is 
not a function of the measurement errors in the cash flow news and return news proxies 
(ie., Vine, and w,,,). This implies that rankings based on 9/4,, are unaffected by risk- 
related measurement errors in cf,,,, and r£, ,. 


Summary 

To summarize, bias and noise attributable to information surprises imply that simple 
regressions of realized returns on expected return proxies yield spurious inferences that are 
attributable to omitted correlated variables. Hence, we include measures of cash flow news 
and return news in our regressions (i.e., Equation (2)). However, because all of the regres- 
sors in Equation (2) contain measurement error, the regression coefficient corresponding to 
the expected return proxy is not a clear-cut indicator of the reliability of this proxy. To 
overcome this problem we use a refinement of the method described in Garber and Klepper 
(1980) and Barth (1991) to estimate the measurement error variances. 


III. EMPIRICAL PROXIES AND SAMPLE CONSTRUCTION 
Accounting-Based Measures of Expected Return 


In this section we provide a brief overview of the seven T€: return proxies we 
evaluate, each of which is imputed from prices and contemporaneous earnings forecasts. 
Our first proxy is based on the assumption that expected cum-dividend aggregate earnings 
for the next two years are valuation sufficient. Hence, it essentially equals the inverse of 
the price-to-forward-earnings ratio. For this reason we refer to it as r,,. The purpose of 
including r, in our analyses is to provide a naive benchmark (based on restrictive as- 
sumptions about future earnings growth).!! 

Our next four expected return proxies are each derived from the finite-horizon version 
of the earnings, earnings growth model developed by Ohlson and Juettner-Nauroth (2003) 
and described in Easton (2004): 


° The refinements are described in Appendix B. 

'? Two observations support tbe assumption that differences in o(vjy41, c71,,,) + TUyy+1, 704,4) across expected 
return proxies are second order: (1) there is no reason to believe errors in our ability to measure expectations 
at time t are correlated with revisions in true expectations occurring during time r+1, and (2) even if this 
correlation is non-zero, there is no reason to believe its magnitude differs across proxies. 

1! To be precise, the valuation model underlying r,, relies on the assumotion that after year +2 cum-dividend 
aggregate earnings grow at a rate equal to the cost of capital. 
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p = CPS int $ EPSu+2 T r X dps, — (1 + r) X epsay 
i r r X (r — Aagr) 


€DSi+1 AST y+) 
= AH L —— o, 
r r X (r — Aagr) (6) 


In Equation (6) P, is price at the end of year t, eps,,, is the year t forecast of year t++ 
earnings per share, dps,,, is the year t forecast of dividends paid in year f+1, r is the 
expected rate of return, and Aagr is a growth rate. Following Easton (2004), we refer to 
the difference between expected year-two cum-dividend accounting earnings (i.e., eps, 
+ r X dps,,,,) and normal accounting earnings that would be expected given earnings of 
period one (1.e., (1 + r) X eps,,,) as "abnormal growth in earnings" or agr,,,. Hence, 
Aagr equals the perpetual rate of change in abnormal growth in earnings beyond the forecast 
horizon. 

The first proxy derived from Equation (6) embeds the assumption that no dividends 
are paid in year t+1 and that Aagr equals 0. As shown in Easton (2004), this proxy is 
equal to the square root of the inverse of the PEG ratio; hence, we refer to it as r,.,.12 
Relaxing the assumption that dps,,, is equal to O yields a modified version of the PEG 
ratio and a proxy we refer to as r,,,.,..° A criticism of F eg and r,,,,, is that the assumption 
of constant abnormal growth in earnings is too restrictive. Gode and Mohanram (2003) 
avoid this criticism by assuming Aagr is a cross-sectional constant equal to the difference 
between the risk-free rate of interest and 3 percent. We refer to their proxy as r,,,. Easton 
(2004), on the other hand, simultaneously estimates r and Aagr for portfolios of stocks 
allowing for cross-sectional variation in Aagr. We refer to his proxy as Taag- 

Our final two measures of expected return are derived from the residual income valu- 
ation model. We refer to the first of these two proxies as r,, because it is based on the work 
of Claus and Thomas (2001). Claus and Thomas (2001) assume that earnings grow at the 
analysts’ consensus long-term growth rate until year z+5. They assume earnings after year 
t+5 grow at the rate of inflation, which is set equal the risk-free rate less 3 percent. The 
second proxy derived from the residual income model is based on the work of Gebhardt 
et al. (2001); hence, we refer to it as r,,,. Gebhardt et al. (2001) use actual earnings forecasts 
to develop estimates of return on equity for year t-- 1 and t+2. They assume that accounting 
return on equity linearly fades to the historical industry median between years #+3 and 
1-12, and remains constant thereafter. 

Finally, note that all of our expected return proxies reflect continuous compounding. 
In particular, the value of e?,,, for a particular set of assumptions equals In(1 + r), where 
r is the discount rate implied by the corresponding valuation model. The valuation models 
underlying each of the expected return proxies and details of their calculation are provided 
in Table 1. 


12 “The PEG ratio, which is equal to the PE ratio divided by the short-term earnings growth rate, is a common 
means of comparing stocks in analysts’ 

13 Since I/B/E/S does not provide forecasts of dividends, we assume dps,,, equals dps, (i.e., dividends per share 
paid in year f). 

14 This method simultaneously estimates the expected rate of return and the growth rate for a portfolio of firms 
that have similar PEG ratios. Assigning these estimates of the expected rate of return to each firm in the portfolio 
introduces measurement error. However, it allows us to avoid having to make ad hoc assumptions about Aagr. 

!5 While both r, and r,, are based on the residual income valuation model, as discussed above, each reflects 
different assumptions about the manner in which ROE evolves after year t-- 2. Our empirical results demonstrate 
that these differences in assumptions lead to significant differences in the statistical properties of the two proxies. 
For example, as shown in Table 3, the correlation between r,, and r,,, is less than 0.50. 
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Cash Flow News and Return News Proxies 


In the Vuolteenaho (2002) return decomposition (Equation (1)), cash flow news cn,,, 
is the component of realized returns corresponding to the change in investors' expectations 
about future cash flows. As shown in Appendix A, cash flow news is defined as follows: 


Chi, = AEs | b "X ss. (7) 


In Equation (7) AE,,,[.] equals E,,,[.] — EL], p is a number slightly less than 1, and roe 
is the natural log of 1 plus the accounting rate of return on equity. This equation implies 
that a revision in expectations about future profitability leads to a realized return that is 
larger (in magnitude) than expected return. The effect on realized return of a particular 
revision in expectations about future profitability is not, however, a cross-sectional constant. 
Rather, the size of the effect depends on investors beliefs about growth, which are captured 
by p. In particular, as shown in Appendix A, p is monotonically increasing in the price-to- 
dividend ratio, which is generally considered a function of future growth opportunities.!^ 
Hence, p can be viewed as a "capitalization factor." 
We use the following formula to estimate our cash flow news proxies: 


Cai = (roe, — Jroe,,, + TOC pat pt — froe, 1) 


p 
too X (froe i, — froe,, +). (8) 


In Equation (8) froe, , denotes forecasted roe for fiscal year k and is based on the consensus 
forecast of eps, made in December of year j, o, is the expected persistence of roe as of 
time t." The manner in which we estimate p, which varies with the price-to-dividend ratio, 
is discussed in Appendix A. Our cash flow news proxy embeds the assumption that roe 
follows a first order autoregressive process after year t--1. This assumption is consistent 
with evidence presented in Beaver (1970), Freeman et al. (1982), and Sloan (1996). 

We estimate œ, via the following pooled cross-sectional and time-series regression: 


TOE ;41 = Gg + @, X roe,,. (9) 


In Equation (9) + is a number between t and t—9 and the sample includes all firm-years in 
the same Fama and French (1997) industry with requisite data in year t—9 through year t 
(i.e., for each year t we estimate œw, using firm years between year ¢~9 and year t). Given 
that accounting methods, competition, and risk vary across industry, our use of an industry 
specific w, reduces the potential for bias attributable to risk-related measurement error in 
Chiti- 





16 Consider the dividend growth model P = en = = z: = which clearly implies the price-to-dividend 
ratio is increasing in the growth rate, g. 

"7 We include (roe, — froe, ) in Equation (8) to capture the change in expectations that occurs in year t-- 1 upon 
the announcement of actual year t earnings. As discussed below, froe,, is based on information that becomes 
publicly available on the third Thursday of the last fiscal month of year t. 
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In the Vuolteenaho (2002) return decomposition (Equation (1)), return news rn,,, is 
the component of realized returns corresponding to the change in investors' expectations 
about future discount rates. As shown in Appendix A, return news is defined as follows: 


Ti+, = AE, P X - (10) 


p has the same role in the above expression as its role in cn,,,, (i.e, it is a capitalization 
factor that allows the sensitivity of realized return to a given change in the expected discount 
rate to vary across stocks on the basis of variation in growth). We use the following formula 
to estimate our return news proxy: 


rf. = ES X (Chine — efi). (11) 


That is, rf,,., varies across valuation models, and embeds the assumption that changes in 
the discount rate are permanent (i.e., the discount rate follows a random walk). This as- 
sumption is consistent with the fact that each of our expected return proxies is the geometric 
average of all expected future discount rates. 

Our cash flow and return news proxies are a function of analysts' forecasts, which also 
serve as the basis of our expected return proxies. This is unavoidable as expectations and 
changes in expectations are inextricably linked. For example, the expected return assigned 
to a particular stock is a function of the risk of the expected payoffs; hence, cn, , reflects 
revisions in the expectations underlying er,,,. Similar logic implies that 7n,,, and er,,, 
are related, which, in turn, implies rf,,, varies across valuation models. 


Sample Construction 


Price at fiscal year-end, equity book value, dividends, and number of shares outstanding 
are obtained from the 1999 Compustat annual primary, secondary, tertiary, and full coverage 
research files. Earnings forecasts are derived from the summary 2000 I/B/E/S tape. We 
determine median forecasts from the available analysts' forecasts on the I/B/E/S file that 
is released on the third Thursday of December. We delete firms with non-December fiscal 
year-ends so that the market-implied discount rate and growth rate are estimated at the 
same point in time for each firm-year observation.'? For observations in 1995, for example, 
the December forecasts became available on the 21st day of the month. These data included 
forecasts for a fiscal year ending ten days later (i.e., December 31, 1995) and either an 
earnings forecast for each of the fiscal years ending December 31, 1996 and 1997 (i.e., 


18 [n the context of a multifactor asset-pricing model, return news is attributable to an unforeseen change in the 
risk-free rate, unforeseen changes in the factor loadings (1.e., the “betas’’), and unforeseen changes in the factor 
premiums (e.g., the market risk premium). In the instant case, a change in the risk-free rate is irrelevant because 
it has an equal effect on all equities; hence, it has no effect on the cross-sectional distribution of realized returns. 
Changes in factor loadings are relevant to the extent they vary acrass stocks. These changes occur because of 
changes in the correlation between a particular factor and stock returns, or a change in the volatility of the 
factor. Changes in factor premiums affect a particular equity by an amount proportionate to the corresponding 
factor loading for that stock. Hence, even though these changes are economy-wide, they do affect the cross- 
sectional distribution of realized returns. Changes in factor premiums occur because of changes in investors’ 
appetite for risk, ability to diversify, etc. 

19 While limiting our sample to firms with fiscal-years that end in December reduces the power of our tests, it is 
unlikely that it leads to bias; hence, our inferences are likely unaffected by this research design choice. 
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€ps,,., and eps,,,,) or the forecast for the fiscal year ending December 31, 1996 (1.e., eps,,,) 
and a forecast of growth in earnings per share for the subsequent years. When available, 
we use the actual forecasts for the subsequent year (in this example, 1997) as our proxy 
for eps,,,. When actual forecasts are unavailable we use I/B/E/S forecasts of growth in 
earnings as the basis for developing our proxy for eps,,;. Realized returns for the year 
following the earnings forecast date (in the example, January 1, 1996 to December 31, 
1996) are obtained from the Center for Research in Security Prices (CRSP) Monthly Return 
file. 

We eliminate firm-year observations (1) prior to 1981 and after 1998 because there are 
very few observations with complete data in these years, (2) with missing price, dividends 
per share, or common shares outstanding, (3) with missing or negative book value of equity, 
(4) with negative eps,,,,, (5) with eps,,. < eps,,,, (6) with a negative consensus forecast 
of long-term earnings per share growth, (7) with missing cum-dividend, continuously com- 
pounded stock return in year #+1, (8) with a missing cash flow news proxy, or (9) with a 
missing return news proxy. Finally, firm-years with values of r,,,, e£,,,, cf, ,, Or fi,,, in 
the top or bottom 1/2 percentile of each yearly, cross-sectional distribution are considered 
outliers and deleted. Our final sample consists of 15,680 firm-year observations. 


IV. MAIN EMPIRICAL RESULTS 

In this section we discuss the descriptive statistics, correlations, and the results of our 
main empirical tests. As discussed above and in Appendix B, the Vuolteenaho (2002) return 
decomposition pertains to continuously compounded returns; hence, we transform realized 
returns and each of the measures of expected return by taking the log of 1 plus the proxy 
of interest. Analyses of untransformed values lead to similar inferences. Descriptive statis- 
tics and correlations are based on the definition of r+, shown in Equation (11). The value 
of rfi,,, underlying our multivariate tests is obtained from Equation (11) and is equal to: 


p 
]—p 


— 





x (ef, zi €f, Li). 


Descriptive Statistics 


Descriptive statistics for realized returns, the expected return proxies, and the cash flow 
news proxy are shown on Panel A of Table 2. The median estimate of r,, is 8.3 percent, 
which is considerably less than the median realized return of 12.2 percent. Moreover, as 
shown in the column titled less r,, 31 percent of the firm-years in our sample have values 
of r,, below the contemporaneous value of the continuously compounded risk-free rate. 
These results suggest that r,, is a downward-biased measure of expected return. However, 
they do not provide evidence about the cross-sectional association between r,, and true 
expected returns, which is the pertinent issue given our research question.” 


20 Given the expected risk premium on a stock is rarely less than zero, a high value of less_r, suggests that the 
average of the expected return proxy is a poor measure of mean expected returns. However, less r, is not 
informative about the cross-sectional characteristics of the proxy (i.e., bias does not imply noise). Hence, in the 
context of our research question a high value of less r, is not prima facie evidence that an expected return 
proxy is unreliable. Conversely, our empirical tests relate to the cross-sectional characteristics of the expected 
return proxies and do not shed light on whether the proxies are useful measures of mean expected returns. For 
example, while our evidence suggests that variation in r,, does not capture variation in true expected returns, 
we cannot conclude that the evidence presented in Claus and Thomas (2001) regarding the magnitude of the 
expected equity premium is unreliable. 
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TABLE 2 
Descriptive Statistics 
Panel A: Realized Returns, the Expected Return Proxies, and the Cash Flow News Proxy 
Mean Std jes-r, Sth => — 25th — — 50th —— 75th 95th 
E ci 0.096 0.366 0.435 —0.551 —0.093 0.122 0.315 0.631 
Tye 0.088 0.034 0.310 0.041 0.065 0.083 0.107 0.152 
Po 0.110 0.032 0.119 0.066 0.090 0.106 0.126 0.169 
hod 0.122 0.032 0.026 0.081 0.100 0.116 0.137 0.181 
Pm 0.129 0.033 0.012 0.086 0.107 0.124 0.145 0.189 
Vises 0.128 0.029 0.005 0.094 0.110 0.122 0.138 0.178 
Ps 0.120 0.032 0.020 0.077 0.098 0.115 0.138 0.180 
T ls 0.109 0.031 0.105 0.063 0.089 0.107 0.126 0.163 
Cf, —0.026 0.384 NA —0.447 —0.151 —0.037 0.044 0.440 
Panel B: The Return News Proxies 
Mean Std 5th 25th 50th 75th 95th. 
rfi, (pe) 0.054 1.317 —].734 —0.298 —0.036 0.265 2.204 
rfi... (peg) —0.027 1.609 —2.390 —0.336 —0.034 0.270 2.352 
rf,,,, (mpeg) —0.022 1.613 —2.384 —0.359 —0.042 0.297 2.383 
rfi. ,., (gm) —0.047 1.665 —-2.461 —0.377 —0.047 0.293 2.343 
TÂ +; (Aagr) —0.023 1.448 —2.013 —0.291 —0.036 0.233 2.043 
TA», (ct) —0.076 1.259 —2.104 —0.359 —0.064 0.236 1.909 
rfi, ., (gls) 0.007 0.928 —1.407 —0.274 —0.037 0.233 1.609 


r,44 18 the realized, continuously compounded return for year t+1. r,, is the expected return estimate imputed 
from the price to forward earnings model. r, is the expected rate of return implied by the PEG ratio. ruses is 
the expected rate of return implied by the modified PEG ratio. r__, and rA; are the expected return estimates 
imputed from Gode and Mohanram’s (2003) and Easton's (2004) implementation of the Ohlson and uettner- 
Nauroth (2003) model, respectively. r, and r,,, are the ex return estimates imputed from Claus and 
Thomas's (2001) and Gebhardt et al.'s (2001) implementation of the residual income valuation model, 
respectively. All the expected return proxies represent continuously compounded returns. cfi,,, is the cash flow 
news proxy. rfi,,, (xxx) is the return news proxy for model xxx. See Table 1 for further details. Mean, Std, 5th, 
25th, 50th, 75th, and 95th represent the mean, standard deviation, 5th percentile, 25th percentile, median, 75th 
percentile, and 95th percentile, respectively. less..r, is the proportion of firm-years in which the return variable 
has a value less than the natural log of 1 plus the contemporaneous risk-free rate of return. 


The adjustment provided by taking short-term earnings growth into account (r,..) causes 
the median estimate of expected returns to increase to 10.6 percent and inclusion of divi- 
dends in the expected payoff (r,,..) causes the median to rise to 11.6 percent. The estimation 
procedure used by Gode and Mohanram (2003) relies on the assumption that abnormal 
growth in earnings increases after year t+1 and the method used by Easton (2004) leads 
to estimates of increases in abnormal growth in earnings that are, on average, positive. 
Hence, the median value of r,,, (12.4 percent) and r,,,, (12.2 percent) exceeds the median 
value of r,,,,,. The medians of r,, and f, which are based on the residual income valuation 
model, are 11.5 percent and 10.7 percent, respectively. 

The median value of the cash flow news proxy (cfi,,,) is —3.7 percent.?! To the extent 
that this measure of cash flow news is attributable to analyst optimism (e.g., Richardson et 


?! The median (mean) value of o, used in the calculation of c+; is 0.52 (0.55). 
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al. 2001) that is ignored by the market, the negative median value of cash flow news 
represents measurement error in our cash flow news proxies. Moreover, given that our 
expected return proxies and return news proxies are also derived from analysts' forecasts, 
the negative median value of cash flow news implies that these constructs are also measured 
with error. 

Descriptive statistics for our return news proxies are shown in Panel B of Table 2. 
Since these estimates equal the change in the estimate of expected return over the realized 
return interval, they differ across the various estimates of expected returns. The median 
estimates of return news are consistently negative. For example, the median return news 
implied by the change in r,,, is —3.4 percent and the median return news for r,, is —6.4 
percent. In light of the fact that prices rose during our sample period, the decline in our 
expected return proxies suggests a coincident decline in the equity premium. 


Correlations 


Table 3, Panel A summarizes the correlations among realized returns, the expected 
return proxies, and the estimates of cash flow news. Pearson product moment (Spearman 
rank order) correlations are shown above (below) the diagonal. Correlations between our 
return news proxies and the remaining variables of interest are shown in Panel B. The 
correlations are the temporal averages of the annual cross-sectional correlations. The 
t-statistics are the ratio of these averages to their temporal standard errors. We focus our 
discussion on the Spearman correlations. The Pearson correlations lead to similar 
inferences. _ 

There is a significant positive correlation between realized return and two of the ex- 
pected return proxies: r,, (0.073, t-statistic of 2.35) and r,,, (0.061, t-statistic of 1.95). None 
of the correlations between the remaining expected return proxies and realized returns is 
statistically different from zero at the 0.05 level. Moreover, three of the correlations are 
negative. These negative correlations imply that the cash flow news and return news com- 
ponents of realized returns reflect more than random measurement error, which only causes 
attenuation bias and has no affect the sign of the correlation.” Rather, these correlations 
support the arguments we made in Section II. Specifically, true cash flow news and true 
return news are correlated with our expected return proxies. Hence, in order to avoid draw- 
ing spurious inferences attributable to omitted correlated variables bias it is crucial that we 
control for variation in cash flow news and returns news. 

As expected, the correlation between realized returns and the estimate of cash flow 
news is positive and significant (Spearman correlation of 0.314 with a t-statistic of 16.99). 
The significant negative correlation between the cash flow news proxy and r,, and the 
significant negative correlations between the cash flow news proxy and each of the estimates 
of expected returns derived from the abnormal growth in earnings model (i.e., Equation 
(6)) provide additional support for our argument that including a cash flow news proxy in 
our regressions is necessary in order to avoid correlated omitted variables bias. These 
negative correlations suggest that firms with relatively high discount rates experienced larger 


2 The correlation between realized return (r,,, © er,,, + cn4,, — rn,,,) and an expected return proxy éf,,,, 


equals Naia e SOAs) ines leet) ae ce ig news and Taka yaa simply random mea- 
OT ys) X Eusi) 

surement error, then the second term in the numerator equals zero and the sign is equal to the sign of the first 

term. The sign of the first term is positive unless the covariance between true expected return and the measure- 

ment error in the expected return proxy is negative and has an absolute value greater than the variance of true 

expected returns. This is unlikely. 
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TABLE 3 
Correlations among Key Variables 


Easton and Monahan 


Panel A: Realized Returns, the Expected Return Proxies, and the Cash Flow News Proxy 


Cfi, 


Ti 


0.056 
(1.31) 
—0.028 
(—1.13) 
0.023 
(0.85) 
—0.003 
(—0.13) 
0.015 
(—0.68) 
0.073 
(2.35) 
0.061 
(1.95) 
0.314 
(16.99) 


r 


a Bf... Fe 


0.048 
(1.16) 


0.386 
(19.48) 


0.577 
(18.4]) 


0.351 
(13.87) 


0.360 
(19.89) 
0.713 
(40.22) 
0.605 
(38.42) 
—0.030 
(— 1.98) 


r 


UE 


—0.032 
(—1.40) 
0.384 
(18.98) 


0.860 
(38.84) 


0.843 
(31.81) 


0.910 
(101.45) 


0.620 
(37.09) 


0.471 
(28.48) 


«0:222 


(—20.03) 


r 


| SR 


0.013 
(0.54) 
0.537 
(15.73) 


0.892 
(44.39) 


0.936 
(47.21) 
0.807 
(38.93) 
0.724 
(59.88) 
0.451 
(25.16) 
—0.178 


(—14.93) 


Imo o tage 
—0.006 — —0.010 
(-0.34) (—0.52) 
0.339 0.325 
(14.93) (17.56) 
0.880 0.866 
(37.33) (26.11) 
0.956 0.817 
(60.30) (20.10) 
0.811 
(18.89) 

0.795 

(32.16) 
0.610 0.573 
(28.21) (38.55) 
0317 0.418 
(12.12) (24.07) 
—0.191  —0210 
(—13.53) (—19.73) 


La 


0.050 
(1.68) 
0.713 
(38.29) 
0.591 
(33.64) 
0.664 
(39.26) 
0.567 
(30.02) 
0.498 
(20.54) 


0.470 
(17.86) 
—0.113 

(—6.16) 


EL AN 
0.057 
(2.03) 
0.599 
(33.13) 
0.415 
(23.05) 
0.394 
(19.52) 
0.278 
(11.25) 
0.331 
(14.84) 
0.428 
(14.13) 


0.068 
(3.43) 


ch, , 


0.243 
(13.71) 
—0.039 

(—2.77) 
—0.141 


(—12.44) 


—0.132 


(—12.94) 


—0.134 


(—13.31) 


—0.129 


(—11.89) 


—0.088 
(—4.55) 
0.148 
(4.11) 


Panel B: Correlations among Return News Proxles, Realized Returns, the Expected Return 
Proxies, and the Cash Flow News Proxy 


Pearson Product Moment Spearman Rank Order 

Model Fus CP nat Chy, T1 CF nat cfl, 
rfi, (pe) —0.339 —0.245 0.055 —0.429 —0.291 0.125 
(—8.97) (—7.79) (2.89) (—14.74) (—6.49) (4.81) 
rf. (peg) —0.174 —0.338 0.031 —0.200 —0.375 0.123 
(—5.36) (—10.77) (1.77) (—7.17) (— 12.00) (4.49) 
rfi,,, (mpeg) —0.198 —0.343 0.026 —0.264 —0.387 0.090 
(—6.36) (—15.79) (1.48) (—10.55) (—18.10) (3.84) 
Ai, (gm) —0.153 —0.356 0.018 —0.214 —0.402. 0.075 
(—5.29) (—14.91) (1.10) (—9.17) (—17.71) (3.69) 
rfi, ., (Aagr) —0.164 —0.414 0.018 —0.174 —0.434 0.097 
(—5.92) (—11.64) (1.01) (—7.38) (—16.39) (4.17) 
TAs, (ct) —0.134 —0.286 0.126 —0.223 —0.323 0.251 
(—3.63) (—11.44) (8.21) (—6.58) (—10.27) (10.22) 
ria (gls) —0.444 —0.240 0.024 —0.475 ~0.272 —0.001 
(— 12.52) (—9.09) (1.44) (— 17.03) (—7.55) (—0.03) 


All correlations reflect averages of the annual correlations. t-statistics (in parentheses) equal the ratio of the 
correlation to its temporal standard error. Cells above (below) the diagonal of the correlation matrix shown in 
Panel À correspond to Pearson product moment (Spearman rank order) correlations. The rows in Panel B are the 
correlations between the measure of return news for a particular model and realized returns, expected returns as 
measured via the model of interest, and the cash flow news proxy. r,,, is the realized, continuously compounded 
return for year t-- 1. r,, is the expected return estimate imputed from the price to forward earnings model. F e is 
the expected rate of return implied by the PEG ratio. r,,.. is the expected rate of return implied by the modified 
PEG ratio. r,,, and Fang are the expected return estimates imputed from Gode and Mohanram’s (2003) and 
Easton’s (2064) implementation of the Ohlson and Juettner-Nauroth (2003) model, ively. r.,and r,,, are 
the expected return estimates imputed from Claus and Thomas's (2001) and Gebhardt et al.'s (2001) 
implementation of the residual income valuation model, respectively. All the expected return proxies represent 
continuously compounded returns. cfi,,, is the cash flow news proxy. rf,,,(xxx) is the return news proxy for 
model xxx. 

See Table 1 for further details. 
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than iab negative information surprises about future cash flows during the time period 
under study.? 

The correlation between r,,, and the cash flow news proxy is positive and significant, 
however (Spearman correlation of 0.068 with a t-statistic of 3.43). A rationale for this 
phenomenon is as follows. There are two essential differences between r,, and the remain- 
ing expected return proxies: (1) the residual income valuation model is anchored on year f 
equity book value, and (2) the terminal value correction used by Gebhardt et al. (2001) is 
a function of historical industry median return on equity (ROE). Thus, compared to the 
remaining estimates of expected returns, the amount of cross-sectional variation in r 
attributable to variation in the firm-specific book-to-market ratio and variation in industry 
ROE is relatively high. These facts are a likely explanation for the positive correlation 
between r,,, and the estimate of cash flow news: variation in our cash flow news proxy is 
also a function of variation in equity book value and it is possible that analysts tend to 
revise their optimistic forecasts of ROE toward the industry ROE.24 

Table 3, Panel B summarizes the Pearson and Spearman correlations among our return 
news proxies and realized return, the cash flow news proxy, and the corresponding estimates 
of expected return. As expected, the correlation between each of the estimates of return 
news and realized returns are significantly less than zero (all the t-statistics are less than 
—3.5). The correlations between the return news proxies and the corresponding proxies for 
expected returns are all negative and statistically significant. These results are consistent 
with a decline in the equity premium during our sample period. As discussed in Section 
IL, if the equity premium fell during the sample period, then stocks with high betas (and, 
thus, high expected returns) Peu larger than average downward revisions in ex- 
pected future discount rates (ie., rn,,,).? Hence, in order to avoid drawing spurious in- 
ferences attributable to omitted variables bias, it is crucial we include a proxy for return 
news in our regressions. 


Multivariate Analyses 


Results pertaining to our multivariate tests are shown in Tables 4 and 5. The regression 
coefficients shown in these tables are the temporal averages of the regression coefficients 
obtained from annual cross-sectional regressions. The t-statistics are computed via the ap- 
proach described in Fama and MacBeth (1973). 

Summary statistics from the estimation of the regression of realized returns on the 
empirical measures of its components (expected return, cash flow news, and return news)— 
Equation (2)—are presented in Table 4 for each of the expected return proxies. The esti- 
mates of the coefficient a, on the expected return proxies are negative for each of the 
regressions except the regression based on r,,, and this estimate is not statistically different 
from zero. Moreover, untabulated results demonstrate that all of the estimates of a, are 


25 These correlations are also consistent with the notion that analysts’ forecasts are too extreme in the sense that 
high forecasts are too high and low forecasts are too low. For example, if year zero analysts' forecasts are too 
high (too low), the implied discount rate will be biased upward (downward) and analysts' forecast errors and 
revisions will be negative (positive). 

24 We test this conjecture by estimating annual regressions of each expected return proxy on the contemporancous 
book-to-market ratio and historical industry ROE. The average, untabulated R?s from these regressions are: ru 
0.135, r, 0.027, r, 0.045, Tom 0.029, Taag 0.026, r, 0.05, and r,,, 0.225. These results support our conjecture 
for two reasons: (1) the highest average ? is obtained in the regression involving r,,, and (2) the average 
R? is higher when the correlation between the expected return proxy and the cash flow news proxy is higher. 

25 Alternatively, these negative correlations may be attributable to measurement error in our proxies. For example, 
a decline in analyst optimism may have occurred during our sample period, or extremely high (low) forecasts 
may be followed by larger than average downward (upward) revisions. 
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TABLE 4 
Multivariate Analyses for Annual Portfolios 


Regression: r,,, = Ooi T Ogei X Piggy + 05,4 X Chiusi F oua X Marr t Ensi 











efi, Qo BL e Qs R? 
P 0.16 —0.54 0.29 0.10 0.22 
(5.33) (—1.46) (5.28) (10.36) 

Pos 0.19 —0.67 0.26 0.04 0.12 
(6.26) (—2.81) (5.59) (6.43) 

P on 0.15 —0.36 0.27 0.05 0.13 
(4.02) (1.20) (5.67) (7.11) 
5s 0.16 —0.40 0.26 0.04 0.11 
(4.90) (—1.75) (5.67) (5.98) 

Tiagi 0.20 —0.70 0.26 0.04 0.11 
(6.16) (—2.70) (5.70) (6.41) 
rs 0.09 0.10 0.30 0.05 0.13 
(1.98) (0.25) (5.09) (4.29) 

T sts 0.24 —1.15 0.29 0.19 0.30 
(7.91) (—4.21) (5.52) (13.52) 


Separate regressions are estimated for each of the 18 annual cross-sections of data. Parameter estimates equal the 
average of the annual regression coefficients. t-statistics (in parentheses) equal the ratio of the parameter 
estimates to their temporal standard errors. R^ is the mean R? from the annual regressions. ef,,,, Chhysis Must, 
and r,,, represent the expected return proxy, the cash flow news proxy, the return news proxy and the 
continuously compounded return for year t-1. r,, is the expected return estimate imputed from the price to 
forward earnings model. r is the expected rate of return implied by the PEG ratio. r,.,, is the expected rate of 
return implied by the modified PEG ratio. rp and Fass; ALE the expected return estimates imputed from Gode and 
Mohanram's (2003) and Easton's (2004) implementation of the Ohlson and Juetmer-Nauroth (2003) model, 
respectively. rand r,,, are the expected return estimates imputed from Claus and Thomas’s (2001) and Gebhardt 
et al.’s (2001) “implementation of the residual income valuation model, respectively. All the expected return 
proxies represent continuously compounded returns. 

See Table 1 for further details. 


significantly less than 1 at the 0.01 level. These results suggest there is a considerable 
amount of measurement error in each of the expected return proxies. On the other hand, 
as discussed in Section II and Appendix B, the observed bias in a, may be attributable to 
other factors; hence, the need for additional analyses. As expected, the estimates of the 
coefficients on cf,,,, and rf,,, are significantly positive. For example, for the regression 
based on r,,,, the estimate of the coefficient on cA,,, is 0.29 (t-statistic of 5.52) and the 
estimate of the coefficient on rfi,,, is 0.19 (t-statistic of 13.52). Nonetheless, untabulated 
results demonstrate that these estimates are also significantly less than one, which implies 
that these constructs are also measured with error. 

The results of estimating Equation (3) for each of our return proxies are shown in Table 
5. Before proceeding with a discussion of the regression coefficients it is interesting to note 
that Equation (3) explains a considerable portion of the cross-sectional variation in the 
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TABLE 5 
Multivariate Analyses for Annual Portfolios 


Regressions: von = Soti + Š, X finer T Sorsi X £a T Š X Eja T Fai 
Vei = Sori t Bitar X Einsi + Soper X Bier TF By X Ser T uei 


Unmodified Noise Variables Modified Noise Variables 
(v, is the regressand) ; (u, 1 is the regressand) 

Model 5, Š, 8, à, R? 6 o" à, Š R 
pe 0.161 0.0076 0132 1.214 094 0.161 0.0005 0.132 1214 0.94 
(5.33) (9.42) (3.65) (6.36) (5.33) (1.42) (3.65) (6.36) 
peg 0.189 0.0152 0.136 2.098 0.96 0.189 0.0012 0.136 2.098 0.96 
(6.26) (7.70) (4.01) (7.89) (6.26) (5.92) (4.01) (7.89) 
mpeg 0.153 0.0142 0.139 2.093 0.96 0.153 0.0008 0.139 2.093 0.96 
(4.02) (8.07) (4.03) (7.85) (4.02) (3.76) (4.03) (7:85) 
gm 0.162 0.0161 0.142 2.284 0.96 0.162 0.0010 0.142 2.284 0.96 
(4.90) (7.99) (4.08) (8.32) (4.90) (5.31) (4.08) (8.32) 

Aagr 0.203 0.0144 0.143 1.712 0.95 0203 0.0008 0.143 1.712 0.96 
(6.16) (8.02) (4.30) (8.71) (6.16) (5.02) (4.30) (8.71) 
ct 0087 0.0070 0.106 1.130 0.93 0.087 0.0003 0106 1:130 0.93 
(1.98) (9.48) (2.91) (7.30) (1.98) (1.25) (2.91) (7.30) 
gis 0239 0.0081 0.150 0552 0.90 0239 0.0004 0.150 0.5532 0.91 
(7.91) (9.28) (4.00) (7.77) (7.991) (1.61) (4.00 (7.77) 


Separate regressions are estimated for each of the 18 annual cross-sections of data. Parameter estimates equal the 
average of the annual regression coefficients. t-statistics (in parentheses) equal the ratio of the parameter 
estimates to their temporal standard errors. R? is the mean R? from the annual regressions. 
Versi = Tusi T fear — Chari 7 yy), Where efi, Cusi Minti and r ,,, represent the expected return proxy, 

the cash flow news proxy, the return news proxy and the realized, continuously compounded return for year t+1. 


All the expected return proxies represent continuously compounded returns. See Table 1 for further details. 


Vgl = Vong: — {OCC gai, Mary) + O(ef,,, rf i) X Chest 


cA.) = Ele Bios: a it — Poner and _ SC 7 Pomi 
iis 07 (£141) EE O7(Can +1) Kas THEa). 


OCP nats iy) (Olfa Mya) is the covariance between the expected return proxy and the cash flow news 
(return news) proxy. The B, g, and oe) terms shown above are taken fram the following first-stage regressions: 


efi = Bia + Binar X Chyi + Bizi X Ausit Einsi 
Chari = Pagi + Baa X ef, + Bouri X Mar + £3, 
yai = Boasi + Basiai X Efi + Bansi X Amar T Eus 


The regression coefficients relating to the measurement error variance of ef,,, are: 


biri 7 Oe) + {00ra Var) T Ou Ved) + {Erur vua) + Orga esi] 
Šf = o (uua) — (o(eru s, Myt) + OCCT gts Madd} — [GG n n5) + Oops My41)} 


97(vj4,,) is the measurement error variance of the expected return proxy, o(vj,,,, u3441) (Orti 03444)) is the 
covariance between the measure error in the expected return proxy and the measurement error in the cash flow 
news (return news) proxy, o(er,,,, Vries) (OCCT p41 v4441)) 18 the covariance between true expected returns and 

the measurement error in the cash flow news (return news) proxy, o(er,,,, Chust) (Oer upo 71,,4)) is the 
covariance between true expected returns and true cash flow news (return news), and o(u,,, p c14,,) (Oirn 

™M,,,,)) is the covariance between the measurement error in the expected return proxy and true cash flow news 
(return news). 
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combined measurement error in our estimates of expected returns (¿.e., v, ,).7? For ex- 
ample, the R* based on r,, is 0.90. This implies the return decomposition developed by 
Vuolteenaho (2002) provides a useful characterization of the components of realized returns. 

Turning to the regression coefficients, two implications are immediate. First, all of the 
expected return proxies contain statistically significant measurement error. For example, the 
proxy with the highest measurement error variance is r,,, (9, of 0.0161, t-statistic of 7.99). 
Second, while r,, contains the least measurement error (ë, of 0.007, t-statistic of 9.48), the 
simplest proxy, r,,, has only a slightly higher measurement error variance (8, of 0.0076, t- 
statistic of 9.42). Moreover, untabulated results demonstrate that the measurement error in 
r, is not significantly greater than the measurement error in r,, or any of the remaining 
proxies.”’ 

Inferences based on variation in 8, are predicated on the assumption that the correlations 
between the measurement error in our expected return proxies and the measurement error 
in the remaining proxies are the same for each of the expected return proxies. As discussed 
in Section II this assumption may not be valid. Thus, we evaluate the modified noise 
variables. The results of these analyses are summarized on the right-hand side of Table 5. 

The expected return proxy with the lowest estimate of 8,” is r,, (0.0003, t-statistic of 
1.25). However, the coefficient on r., is only slightly less than the estimate of 5,” pertaining 
to r,, (0.0005, t-statistic of 1.42), and the difference between these two coefficients is 
statistically insignificant (untabulated t-statistic of 0.99). Moreover, the coefficient on r, is 
smaller than each of the coefficients on the remaining proxies except r,,,, and the coefficient 
on r;,, is not statistically different from the coefficient on r,.. 


Sensitivity Analyses 

The results presented in Tables 4 and 5 are striking. None of the expected return proxies 
we evaluate has a statistically positive association with realized return even though we 
control for information surprises attributable to changes in expectations about future cash 
flows and future discount rates. Further, none of the proxies has less measurement error 
than the simplest proxy, r,,, which is based on a restrictive set of assumptions about future 
growth and profitability. Taken together these results suggest the proxies we evaluate are 
unreliable. Given the provocative nature of this conclusion, we discuss the (untabulated) 
results of three sets of robustness checks. 

In our first set of robustness checks we evaluate the sensitivity of our results to as- 
sumptions about p. The results shown in Tables 4 and 5 are based on the assumption that 
p varies with the price-to-dividend ratio; however, it is possible that p varies for other risk- 
related reasons. To alleviate this concern we re-estimate the regressions underlying Tables 
4 and 5 using values of p that vary with: (1) size (equity market value), and (2) the debt- 
to-equity ratio." Results based on these robustness checks are similar to the results shown 


26 The untabulated means (medians) of v,,,, for the expected return proxies are: r,, 0.09 (0.03), r, —0.01 (0.01), 
r, | —0-02 (—0.02), r, —9.05 (—0.03), Faser —0.03 (—0.01), r, —0.07 (—0.03), and r,,, 0.02 (0.01). These 
amounts suggest that, for several of the expected return proxies, the average (median) combined measurement 
error is nontrivial. They are not, however, informative about the cross-sectional characteristics of the proxies as 
bias does not imply noise. 

? The measurement error in r,, is significantly lower than the measurement error in all of the expected return 
proxies derived from the abnormal growth in earnings model (i.e., Equation (6)). To illustrate how we test the 
difference between 8, for two expected return proxies, consider r,, and r,,,. We begin by calculating annual 
differences between the value of 5, pertaining to r,, and the value of 8, pertaining to r,,,. Next, we calculate 
the mean and standard error of these annual differences, and form t-statistics by taking the ratio of these two 
numbers. 

7 We use an approach similar to the approach shown in Appendix B to estimate different values of p for size 


(debt-to-equity) quintiles. 
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in Tables 4 and 5. We also obtain similar results to those shown in Tables 4 and 5 when, 
as per Vuolteenaho (2002), we assume p is equal to 0.967 for all firm-year observations. 

In our second set of robustness checks we evaluate whether our results are attributable 
to extreme observations. We begin by re-estimating Equation (2) via rank regressions, which 
yield inferences similar to those based on the results shown on Table 4.2 Rank regressions 
cannot be used to estimate 8, and 8,; hence, to evaluate the sensitivity of our estimates 
of the measurement error in each proxy we re-estimate 8, and 8,” after deleting firm-years 
with values of r,,,, e£,,,, CA, +1, Or rfi, in the top or bottom fifth percentile of the annual 
cross-sectional distribution.?? This modification does not change our inferences. 

Finally, we evaluate the robustness of our Fama and MacBeth (1973) t-statistics by 
calculating t-statistics on precision weighted regression coefficients. Specifically, we divide 
each annual coefficient by its contemporaneous standard error, and calculate t-statistics by 
dividing the mean of the precision weighted coefficients by their temporal standard errors. 
These t-statistics are similar to those shown on Tables 4 and 5. 


V. INSTRUMENTAL VARIABLES, GROUPING, AND LONG-TERM 
GROWTH PARTITIONS 

Two common approaches for dealing with measurement error are: (1) instrumental 
variables, and (2) grouping. In this section we examine the effect these methods have on 
the reliability of the expected return proxies. Our motivation for this analysis is two-fold. 
First, if instrumental variables or grouping mitigates the measurement error in the expected 
return proxies, business practitioners and researchers can use these methods as the basis 
for developing estimates of expected return. Second, these methods provide an additional 
means of evaluating the robustness of the results discussed in Section IV. In particular, we 
demonstrate that for each expected return proxy the estimate of o, from Equation (2) is 
not statistically positive even after we attempt to purge the proxy of its measurement error. 
In addition, r,, continues to perform as well as less simple proxies.*! 

In this section we also examine how our results vary with the magnitude of analysts' 
forecasts of long-term growth and with the magnitude of analysts' ex post forecast errors. 
Consistent with La Porta (1996) and Frankel and Lee (1998) we demonstrate a positive 
association between analysts’ forecasts of the long-term earnings growth rate, /g,, and errors 
in analysts' forecasts of earnings. Next, by combining evidence about the variation in o, 
and 5, across partitions of the data formed on the basis of /tg, with evidence about the 
variation in e, and 8, across partitions of the data formed on the basis of ex post analysts’ 
forecast errors, we demonstrate that: (1) the results documented in Section IV are attrib- 
utable to errors in the earnings forecasts underlying our estimates of expected return, and 
(2) itg, is a useful ex ante indicator of forecast quality. This evidence is useful for three 
reasons. First, it sheds light on potential future research opportunities (e.g., development of 
better earnings forecast models or statistical approaches to purge the error from analysts' 


? Ranks are determined separately for each annual cross-section and the t-statistics pertaining to the rank regres- 
sions are estimated via the Fama and MacBeth (1973) procedure. 

3 Valid comparisons of 8, (8,4) across expected return proxies can only be drawn if the same set of observations 
underlies each estimate of 8, (8,5). Hence, for the purposes of this robustness check we discard observations 
with values of ef,,, or "ah in the top or bottom fifth percentile for any of the expected return proxies when 
estimating 8, (875) for a particular expected return proxy. This leads to a considerably smaller sample size of 
8,748 firm-year observations. 

^ We also evaluate a proxy, Tap which equals the average of r,,, pep Pope ‘se Taagn Teo aNd r, Untabulated 

results demonstrate that the estimate of a, corresponding to Tov E" negative, and r,, does not have a lower 

measurement error variance than r e 
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forecasts).?? Second, this evidence provides researchers with guidance regarding the relia- 
bility of the expected return proxies in different empirical settings. For instance, researchers 
should be particularly careful when interpreting evidence about the cross-sectional deter- 
minants of the expected return proxies when this evidence is based on a sample that pri- 
marily consists of high /tg, firms. Finally, we show that for observations in the bottom third 
of the distribution of /tg, r., contains very little measurement error, which suggests this 
proxy is a reliable measure of expected returns for a nontrivial subset of our sample.?? 


Instrumental Variables 


We implement the instrumental variables procedure in the following manner. First, each 
of the expected return proxies is regressed on instruments that are for the purposes of these 
analyses assumed to be correlated with true expected return but uncorrelated with the 
measurement error. Next, the fitted values from the instrumental variables regression are 
evaluated in the same manner as the underlying expected return proxies. 

We select the following instruments: CAPM beta, market capitalization, the ratio of 
equity book value to equity market value, the standard deviation of past returns, and industry 
type. Before discussing the empirical results pertaining to the instrumental variables anal- 
ysis, we briefly motivate our choice of instruments. 

Well-known theoretical results developed by Sharpe (19645, Lintner (1965), and Mossin 
(1966) suggest that beta is an appropriate measure of systematic risk. Our firm-year esti- 
mates of beta, BETA,, are derived from regressions of monthly returns on contemporaneous 
excess returns (i.e., market return less the yield on a one-month treasury bill) where the 
data are drawn from the 60-month period prior to the earnings forecast date. Malkiel and 
Xu (1997) focuses on total risk as measured by the variance of returns. We use the standard 
deviation of the daily returns occurring during the year leading up to the forecast date, SD,,, 
as our measure of total risk. A variety of studies (e.g., Banz 1981; Fama and French 1992) 
document a negative association between size and realized returns suggesting that market 
capitalization as of the earnings forecast date, SZZE,, is a potential proxy for risk. Consistent 
with Fama and French (1992) and Berk et al. (1999), we use the ratio of the book value 
of common equity to the market value of common equity at the earnings forecast date, 
BM,, as another measure of risk. Finally, a number of papers (e.g., Fama and French 1997; 
Gebhardt et al. 2001) show that the equity premium differs across industries. Similar to 
Gebhardt et al. (2001) we use the average of the firm-specific estimates of expected return, 
IND,, as our proxy for industry type Cdustry classifications are the same as those in Fama 
and French [1997]). 

The results of estimating regressions of each of the expected return proxies on the four 
risk proxies (BETA,, SD,,, SIZE,, and BM, are shown on the left-hand side of Table 6. For 
all of the expected return proxies other than r,, and r,, the estimates of the coefficients on 
BETA, and SD,, are positive (the coefficient on BETA, is not statistically significant in the 
T, "-, aNd r,,, regressions). Contrary to expectations, the estimates of the coefficient on SIZE, 
are not significantly less than zero (at the 0.05 level) for all proxies other than r,, (—0.000 
with a t-statistic of —3.77). À possible explanation for these results is that the I/B/E/S 
sample on which our analyses are based tends to include larger firms. Finally, all of the 
proxies except r„ have a positive and statistically significant association with BM,,. 


32 For example, Guay et al. (2003) provide evidence consistent with the notion that measurement error in accounting 
based estimates of the expected rate of return is partially attributable to sluggishness in analysts updating of 
earnings forecasts. 

33 Inferences based on the results obtained from the use of instrumental variables, grouping, /ftg, partitions, and 
partitions based on analysts' forecast errors are not sensitive to the robustness checks discussed in Section IV. 
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It is evident from the results in the right-hand side of Table 6 that industry type captures 
much of the variation in the expected returns proxies. For example, the estimate of the 
coefficient on IND, for the regression where the dependent variable is r„ is 0.969 with a 
t-statistic of 46.54. Nonetheless, the coefficients on BETA,, SD,, SIZE,, and BM, are es- 
sentially unchanged with the exception that r,, and r„ are positively associated with SIZE, 
after IND, is included in the regression. 

As shown on Table 7, the fitted values taken from the instrumental variables regressions 
have lower measurement error variances (i.e., 8,) than the raw proxies. On the other hand, 
only r,, has a fitted value with a lower measurement error variance than the fitted value of 
Tą and untabulated results demonstrate that this difference is only statistically significant 
when the fitted values are taken from regressions that include IND, (t-statistic of 3.55). 
Moreover, none of the estimates of a, shown on Table 7 are statistically greater than zero. 
Hence, even after attempting to purge the proxies of their measurement error via instru- 
mental variables, we continue to find little evidence supporting an argument that the proxies 
we evaluate are reliable measures of expected returns. 

Several studies (e.g., Botosan and Plumlee 2005; Gebhardt et al. 2001; Gode and Mo- 
hanram 2003) use coefficients from regressions of proxies for expected returns on various 
risk factors (or correlations between proxies for expected returns and risk factors) as means 
of ranking expected returns proxies. It is evident from the coefficient estimates on the right- 
hand side of Table 6 that the expected return proxy that is best according to the criteria 
adopted by these authors is 7,,,. Specifically, in the r,,, regression, the t-statistic for the 
estimated coefficients on BETA, and SD,, (5.63 and 13.03, respectively) are higher than the 
corresponding t-statistics in the other regressions, the estimate of the coefficient on SIZE, 
is negative (t-statistic of —2.39), and the coefficients on BM, and IND, are highly signifi- 
cant. Nonetheless, the evidence in Table 7 demonstrates that r,., is one of the worst proxies 
as it has the highest 8,“ and one of the lowest estimates of o,. In other words, ranking 
expected returns on the basis of correlations with potential risk factors may lead to erro- 
neous conclusions. 


Grouping 

The instrumental variables procedure combines variables that are, under the maintained 
hypothesis, expected to be correlated with expected returns. Grouping, on the other hand, 
focuses on each variable separately. We group observations into annual portfolios of 10 
based on variables designed to minimize the within portfolio variation in the construct of 
interest (e.g., beta) and maximize the variation across portfolios.* All analyses that are 
conducted at the firm-specific level are repeated using portfolio averages (i.e., average 
realized return, average expected return, average cash flow news, and average return news). 

The results of our portfolio level analyses shown in Table 8 lead to inferences that are 
similar to those based on the results shown in Tables 4, 5, and 7. In particular, none of the 
estimates of a, taken from Equation (2) is statistically greater than zero. Moreover, the 
rankings of 5,” pertaining to the portfolio averages of the expected return proxies are, by 
and large, similar to the rankings of 8,1 pertaining to the firm-specific estimates. There is 
one exception, however: the simplest expected return proxy, 7,,, does not have the lowest 


* Analyses based on portfolios of size 20 lead to similar inferences. Nonetheless, it is possible that the use of 
even larger portfolios leads to a significant reduction in measurement error. Unfortunately, we do not have . 
sufficient degrees of freedom to reliably evaluate this assertion; hence, we cannot make definitive statements 
about the reliability of the proxies at the portfolio level. Purthermore, we cannot assess the accuracy of estimates 
of the expected rate of return that are specifically designed for large portfalios of observations (such as, Easton 
et al. 2002; O'Hanlon and Steele 2000). 
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TABLE 7 
Multivariate Analyses for Annual Portfolios Using Fitted Estimates of Expected Returns 
from Instrumental Variables (IV) Regressions 
Regressions: 
Furi = Qori T 04:4 X Char + Ga X Chuti T+ Ogi X mu + Es 
Vg) = oeri T Ofer X Ofer + San. X Baar + Baer X Bear T Pann 


a oj 
IV Regression IV Regression IV Regression IV Regression 

Model One Two One Two 

pe —1.40 —0.69 0.0001 0.0003 
(—1.63) (—1.16) (0.79) (1.11) 

peg —3.49 SLT 0.0005 0.0006 
(—3.23) (—2.58) (3.59) (3.87) 

mpeg —2.89 —1.38 0.0002 0.0003 
(—1.54) (—1.53) (2.28) (2.21) 

gm —7.36 —1.79 0.0002 0.0003 
(—3.19) (—2.09) (3.02) (2.98) 

Aagr —4.69 —2.76 0.0003 0.0004 
(—2.09) (—2.62) (3.20) (3.34) 

ct 0.00 0.84 —0.0001 —0.0001 
(0.00) (1.13) (—1.28) (—0.40) 

gls =2.31 —1.19 0.0003 0.0005 
(—3.22) (-2.52) (3.87) (2.83) 


Separate regressions are estimated for each of the 18 annual cross-sections of data. Parameter estimates equal the 
average of the annual regression coefficients. t-statistics (in parentheses) equal the ratio of the parameter 
estimates to their temporal standard errors. R? is the mean R? from the annual regressions. 

Ucu+1 = Tusi T Cass — Aas, — rfi a, Where ef,,, lyst, usi and r,,; represent the predicted expected 
return proxy taken from the following instrumental variables regressions: 


ef, ,, = 8, + 0,, X BETA, + 065, X SD, + 04 X SIZE, t 0, X BM, + y, (IV Regression One) 
Chui; = B, + 0,, X BETA, + 02, X SD, + 03, X SIZE, + 0, X BM, + 0, X IND, + vy, 
(IV Regression Two) 


In the above regressions BETA is the capital asset pricing model beta estimated over the 60 months prior to the 
end of year t, SD is the standard deviation of daily returns in year t, SIZE is the market capitalization at the end 
of year t, BM is the ratio of the book value of common equity to the market value of common equity at the end 
of year t, and IND is the mean at the end of year t of the expected return proxies across firms in the same 
industry group. 

The cash flow news proxy, the return news proxy, and the realized, continnously compounded return for year 
t+1. All the expected return proxies represent continuously compounded returns. 

See Table 1 for further details. 


Vei = vau 7 (OCP gers cfi ui) + TCP yet, Mad} X Stee 


_ ZE — Bross _ ZE — Bae _ ZE — Booi 
iue = O (£541) Fiesi Crt) ' and faa T(Ezn+1) 


(continued on next page) 
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TABLE 7 (continued) 


OCP nays Aas) (o(ef,,., rfi) is the covariance between the expected return proxy and the cash flow news 
(return news) proxy. The 8, e, and o7(e) terms shown above are taken from the following first-stage regressions: 


ef, , = B, t Bisi X Chaar + Bia X uyi Eu 
Chri = Bog + Barer X ef, + Bearer X rua + Eua 
rñ; = Bx + Pana X ef, + Borer X Cheri + Eye 


The regression coefficients relating to the measurement error variance of ef,,, are: 


557 Sus) — (oles ys CMs) + Ollo mauu) — {ro cu) + Osso mu) 


9?(v,4,.,) is the measurement error variance of the expected return proxy, o(er,,,, cn.) (OCT). 7n,,4)) is 

the covariance between true expected returns and true cash flow news (return news), and o(v,,,,, Chay) (Oin 
7n,41)) is the covariance between the measurement error in the expected return proxy and true cash flow news 
(return news). 


measurement error variance (i.e., 9,7) when portfolios are formed on the basis of the book- 
to-market ratio. 


Long-Term Growth Partitions 


First, for each annual cross-section of data we partition observations into three groups: 
(1) bottom third of the distribution of lg, (low /tg;), (2) middle third of the distribution of 
ltg,, (medium Itg;, and (3) top third of the distribution of /tg, (high itg). Second, we 
calculate descriptive statistics for each of the itg, partitions for Itg, and the absolute value 
of the difference between firm i's actual return on equity in year r+2 and the forecast as 
of year t of firm i's year t+2 return on equity (|FE ROEJ). As shown in Panel A of Table 
9, |FE. ROE, is increasing in /tg,. For example, the mean of |FE .ROE,| for observations in 
the low itg, is 5 (7) percent lower than the mean of |FE_ROE,| for observations in the 
medium (high) itg, partition, and the untabulated t-statistic for this difference is —6.26 
(—8.95).^ These results are consistent with those shown in Frankel and Lee (1998) and La 
Porta (1996), and imply that tg, is a reasonable ex ante proxy for ex post forecast errors. 

Second, for each proxy we estimate a, and 5,” via Fama and MacBeth (1973) regres- 
sions for each of the three /tg, partitions. Taken together with the descriptive statistics shown 
on Panel A of Table 9, the regression results shown on Panel B of Table 9, imply that there 
is a positive association between the reliability of the expected return proxies and the quality 
of analysts’ earnings forecasts. In particular, as lg, increases, x, declines and 8,” increases 
for four of the seven proxies (r,, Tmpeg Taagr and Ta). Moreover, estimates of a, for firm- 
years with low /tg, are positive for all of the proxies except r,,,. It is also pertinent that, for 
the low itg, partition, the estimate of a, corresponding to r,, is statistically greater than zero 
(t-statistic of 3.04) and r,, has a statistically lower measurement error variance (i.e., 8,4) 


35 The mean of |FE.. ROE| for observations in the medium Kg, partition is 2 percent lower than the mean of |FE. 
ROE} for observations in the high itg, partition, and the untabulated t-statistic for this difference is —3.18. 
Moreover, as shown in Panel A of Table 9, [FE .ROE|| is monotonically increasing in itg, for all percentiles of 
IFE .ROE,, and all of the t-statistics (untabulated) associated with these differences are greater than 3 in absolute 
value with the exception of the t-statistic pertaining to the difference between the 95th percentile of |FE ROE] 
for firms in the medium ig, partition and the 95th percentile of |FE.ROE| for firms in the high tg, partition. 
The t-statistics equal the mean of the annual differences divided by their standard error. Finally, the untabulated 
Pearson (Spearman) correlation between /tg, and |FE .ROE| is 0.13 (0.25) with a t-statistic of 6.41 (17.46). 
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than r,, (untabulated t-statistic of 2.29). Hence, for a nontrivial subset of our sample, r,, 
appears to be a reliable proxy for expected returns. 

Finally, in order to provide further insights regarding the association between analysts' 
forecast errors and the reliability of the expected return proxies, we partition the data into 
thirds on the basis of |FE .ROE,| and estimate a, and 6,” for each partition. The results of 
this analysis, which are shown on Panel C of Table 9, support the notion that there is a 
strong, positive association between the quality of analysts’ forecasts and the reliability of 
the proxies. In particular, as |FE_ROE| increases, a, declines and 5," increases. Moreover, 
for firm-years with low |FE .ROE, estimates of œ, are positive for all of the proxies (the 
estimates for r, Tem and r,, are statistically significant), and estimates of 8," are not 


statistically different from zero. While |FE .ROE| is unobservable at time z and, thus, cannot 


TABLE 9 
Associations between the Long-Term Earnings Growth Rate (itg), Absolute Forecast Errors 
(IFE. ROE,;), and Reliability of Expected Return Proxies 


Panel A: Descriptive Statistics of itg, and |FE_ROE,| by itg, Partitions 


Mean Std Sth 25th 50th 75th 95th 
itg, 
Low itg, 0.085 0.024 0.037 0.073 0.093 0.100 0.120 
Medium itg, 0.138 0.021 0.110 0.120 0.135 0.150 0.180 
High itg, 0.241 0.086 0.150 0.180 0.215 0.280 0.400 
[FE ROE] 
Low itg, 0.103 0.259 0.003 0.016 0.043 0.106 0.328 
Medium Itg, 0.153 0.388 0.006 0.027 0.063 0.142 0.487 
High /tg, 0.173 0.378 0.010 0.041 0.089 0.179 0.483 
Panel B: Regression Analyses by lg, Partitions 
Fari = Gg, T Qira X Chingy T 00,4 X Cusi + 03,4 X Aya + Enyi 
Vener = Borer + Ota X Efer F Su X Cheer T Saar X Baar T Mun 
a, oj 
Model Low lig, Medium /tg, High lig, Low itg, Medium lig, High /fg, 
pe 0.27 —0.50 — 1.82 0.0004 0.0008 0.0011 
(0.65) (—1.74) (—4.12) (1.17) (2.95) (2.84) 
peg 0.16 —0.25 — 1.34 0.0009 0.0008 0.0010 
(0.48) (—0.67) (—3.40) (3.22) (4.13) (4.93) 
mpeg 0.35 —0.24 — 1.20 0.0006 0.0008 0.0009 
(0.99) (—0.62) (—2.89) (2.28) (3.94) (4.19) 
gm 0.24 0.02 —0.94 0.0009 0.0009 0.0009 
(0.81) (0.04) (—2.26) (3.18) (4.17) (4.48) 
Aagr 0.20 —0.46 — 1,44 0.0006 0.0007 0.0008 
(0.61) (—1.34) (—3.49) (2.38) (5.13) (5.16) 
ct 1.31 0.44 —0.85 0.0001 0.0002 0.0005 
(3.04) (0.97) (— 1.40) (0.27) (0.73) (1.64) 
gls —0.79 —0.97 —2.43 0.0008 0.0004 0.0000 
(—2.16) (—4.30) (—4.66) (2.30) (2.00) (0.02) 


(continued on next page) 
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TABLE 9 (continued) 


Panel C: Regression Analyses by |FE..ROE,| Partitions 


Tipe = Opens T Opes X CPress T Quei X cñ,,, +f Ogee, X uu T Enpi 
Verri = Soren + Star X Einsi + Š, X Eja T Sua X Chaar T Wari 
x, or 

Low Medium High Low Medium High 
Model IFE. ROE| IFE .ROE, IFE ROE, IFE ROE, IFE —ROE| |IFE. ROE, 
pe 0.41 —0.22 —1.19 —0.0002 0.0000 0.0015 

(1.41) (—0.63) (—2.30) (—0.47) (0.06) (2.92) 
peg 0.57 —0.36 —1.18 0.0000 0.0008 0.0024 

(1.34) (—1.14) (—3.19) (0.00) (3.13) (6.69) 
mpeg 0.98 —0.05 —0.98 —0.0003 0.0004 0.0021 

(2.96) (—0.13) (—2.54) (— 1.53) (1.62) (6.17) 
gm 0.76 —0.19 —0.93 —0.0001 0.0006 0.0023 

(2.32) (—0.68) (—2.66) —0.75) (2.93) (6.98) 
Aagr 0.60 —0.38 —1.18 —0.0001 0.0005 0.0020 

(1.39) (—1.09) (—2.69) (—0.34) (2.48) (5.57) 
ct 1.25 1.01 —0.43 —0.0003 —0.0003 0.0007 

(4.06) (2.52) (—0.76) (—1.39) (—1.50) (2.28) 
gls 0.22 —0.93 —1.72 0.0000 0.0004 0.0003 

(0.79) (—2.88) (—3.59) (0.07) (1.61) (1.10) 


ltg, (IFE_ROE.) partitions are formed each year by sorting the data into thirds on the basis of the consensus 
analyst forecast of the long-term growth in earnings, ltg, (the absolute value of the difference between actual 
ROE, and the forecast as of year t of ROE,,., FEROE D. Low tg, (FE. ROEj), medium itg, (FE_ROE}), 
and high itg, (IFE..ROE]) denote observations with values of lrg, (FEROE, J) in the bottom, middle, and top 
third of the distribution, respectively. Mean, Std, 5th, 25th, 50th, 75th, and 95th represent the mean, standard 
deviation, 5th percentile, 25th percentile, median, 75th percentile, and 95th percentile, respectively. Separate 
annual cross-sectional regressions are estimated for each partition. Parameter estimates equal the average of the 
annual regression coefficients. t-statistics (in parentheses) equal the ratio of the parameter estimates to their 
temporal standard errors. R? is the mean R? from the annual regressions. 


Ucuei — lire. 7 


fas = 


— Tr where e eit, Cypis ngi and ry,, represent the expected return proxy, 


the cash flow news proxy, the return news proxy and realized, continuously compounded return for the year t+1. 
All the expected return proxies represent continuously compounded returns. 
See Table 1 for further details. 


UG+1 = Vener, — [o(ef,,+i cu) + o(ef,. o fuu) X ea 


=f — B eae 7C +1 —Ëas+1 7 Borer 
cA, = DEus Piet = Pane T Boge ong = Ifa — Pau 
cis THUE) Eur 0I (£41) Kas 02(€sy+1) 


o(ef,,, Anas) (Oleari rñ,, )) is the covariance between the expected return proxy and the cash flow news 
(return news) proxy. The B, £, and o2(g) terms shown above are taken from the following first-stage regressions: 


€f. = Bisa + Biter X Aysr + Bias X rua + E 
Chari = Bog + Baia X €f, + Borna X rua T £y 
Thusi = Baoesr + Barer X ef, + Barrer X Cua + ES, 


The regression coefficients relating to the measurement error variance of ef,,, are 


oie = O (v.i) — {OCT gay Mya) + OCC yey) Mya} — (OU ways cu) t CU ger Mas) 


o7(U,,+,) is the measurement error variance of the expected return proxy, o(er,,,, cn,,44) (o(eruí,, Mysi) is the 
covariance between true expected returns and true cash flow news (return news), and o(v,4,,, Chus) (OUr 
™,,,1)) is the covariance between the measurement error in the expected return proxy and true cash flow news 
(return news). 


The Áccounting Review, April 2005 


An Evaluation of Accounting-Based Measures of Expected Returns 531 


be used on an ex ante basis, these results remain pertinent as they imply that the devel- 
opment of better approaches for forecasting earnings or statistical models of analysts' fore- 
cast errors are fruitful research opportunities. 


VI. SUMMARY AND CONCLUSIONS 

We develop an empirical approach that allows us to evaluate the reliability of an ex- 
pected return proxy from its association with realized returns even if realized returns are 
biased and noisy measures of expected returns. Our results suggest that, for the entire cross- 
section of firms, the seven accounting-based proxies we consider are not reliable measures 
of expected returns. None of the proxies has a positive association with realized returns 
even after controlling for the bias and noise in realized returns attributable to contempo- 
raneous information surprises. Moreover, the simplest model, which is based on the least 
reasonable assumptions ex ante, vields an estimate of expected returns that contains no 
more measurement error than the remaining proxies. These results are robust as they remain 
descriptive even after we attempt to purge the proxies of their measurement error via the 
use of instrumental variables and grouping. 

Further analyses, however, demonstrate that certain proxies are reliable for nontrivial 
subsets of the data. First, we show that the reliability of the proxies is decreasing in the 
magnitude of consensus analysts’ long-term earnings growth rate, and that r,, (the proxy 
inspired by Claus and Thomas [2001]) is a reliable proxy for firms with low consensus 
long-term growth forecasts. We also demonstrate that when ex post analysts' forecasts errors 
are low, all of the proxies have a positive association with expected returns. Combining 
these two sets of results with the fact that ex post forecast errors are increasing in analysts’ 
long-term growth forecasts leads us to conclude that the consensus long-term earnings 
growth rate is a useful ex ante indicator of reliability. 

This study has three implications for future research. First, the approach we describe 
can be extended and used in other contexts. For example, development of alternative proxies 
for cash flow news and return news will improve researchers' ability to draw direct insights 
about the association between an expected return proxy and true expected returns. This, in 
turn, will ameliorate the need for the more complex analyses of measurement error vari- 
ances. In addition, the identity of the model of market equilibrium that best characterizes 
the risk-return trade-off is of significant interest to many, and our approach provides a 
means of evaluating models of expected returns suggested by asset-pricing theory. 

Second, given the general lack of reliability of the proxies we evaluate, the extant 
evidence in the accounting and finance literatures based on these proxies should be inter- 
preted with caution. In addition, further research on approaches for imputing reliable esti- 
mates of expected returns from contemporaneous prices appears warranted. Finally, by 
demonstrating a link between the apparent lack of reliability of the proxies we evaluate and 
the quality of analysts' forecasts, we provide direction for future research— development 
of better approaches for forecasting earnings or statistical models of analysts’ forecast errors 
appear fruitful. 


APPENDIX A 
Linear Decomposition of Realized Returns and Estimation of p 
In this appendix we describe Vuolteenaho's (2002) linear return decomposition, which 
serves as the foundation for our empirical tests. We also describe the statistical procedure 
we use to estimate p. 
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Vuolteenaho's (2002) Linear Return Decomposition 
We begin by noting the following identities: 


+ 
1 = (1 + ROE,)! X — (A.1) 
it 
+ 
1 = (1 + RS x k (A.2) 
it 


In Equations (A.1) and (A.2) ROE,,,, equals E,,,/B,, E,,; is earnings at time t+1, B, is 
equity book value at the end of period t, D,,,, is dividends paid during period t+1, R,,,, 
is stock return for period t+1 and M, is equity market value at the end of period t. We 
assume clean surplus accounting; hence, Equation (A.1) is an identity (ie., B,,, + Das 
x B, T Ens). 

Dividing each side of Equations (A.1) and (A.2) by D,, rearranging, and taking logs 
we obtain the following expressions for the ratio of equity book value to dividends and the 
price to dividend ratio: 


(b, — d,) = In (B,) — In (D,) 


= In (2) = In (erizdan + 1) + Ad, — roe,,,, (A.3) 
ü 
(m, — dj) = In (Mj) — In (D) 
- In (Me) = In (e«t + 1) + Ad, — rua. (A.4) 
it 


In Equations (A.3) and (A.4) lower case letters denote natural logs, Ad,,, is the natural 
log of dividend growth at time t--1 (ie. d,,, — d, = In (D,,,/D,)), roe,,, denotes the 
natural log of 1 plus ROE,,,, and r,,, denotes realized, continuously compounded returns 
for time t--1. Equation (A.4) illustrates that the price to dividend ratio is increasing in 
future dividend growth but decreasing in the future discount rate. An analogous interpre- 
tation holds for Equation (A.3). 

While Equations (A.3) and (A.4) are identities, they are also nonlinear (i.e., they are 
both functions of logged expressions). Hence, they cannot be directly converted into a linear 
expression for the natural log of the book-to-market ratio, which is necessary for the de- 
velopment of linear decomposition of realized returns. 'To derive a linear expression for the 
natural log of the book-to-market ratio, we approximate Equations (A.3) and (A.4) by tak- 
ing Taylor expansions of In (e+ d+ + 1) and In (em 79 +. ]) about the point 
e XG-7dyt(ü79X0742 (X is a number between 0 and 1 and m — d and b — d denote uncon- 
ditional means). Next, we subtract the approximation of Equation (A.4) from the approxi- 
mation of Equation (A.3). This yields the following linear approximation of the natural log 
of the book-to-market ratio: 


g^ X(b-4 0 —3)X(n-4) 


(b, - mj ^ 1 + ge*e-a*a-0X(n-4d) X (by, — mia) + Fati — T0644 


= p X (Duti — my) + Fusi — r0654, (A.5) 
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In Equation (A.5) p is a number slightly less than 1. To understand p better it is useful 
eo MID M. 
to assume À equals 0, in which case p = —— = 1+M/D +W , which implies 
that p is bounded between 0 and 1, and is increasing in the price-to-dividend ratio. 


Assuming lim p™™! X (byr — mj47) = 0 and iterating Equation (A.5) forward, yields 
T—« 
the following approximate expression for the natural log of the book-to-market ratio: 


(b, — m,) == > p xX it > pe lx r OE n+ (A.6) 


T=! 


Equation (A.6) illustrates that the book-to-market ratio is increasing in future discount 
rates (i.e., prices are low when expected future discount rates are high) and decreasing in 
future accounting returns (i.e., prices are high when expected profitability is high). 

Finally, we obtain a linear approximation of realized returns by evaluating the change 
in expectations of Equation (A.6) from t to t-- 1 and rearranging:?é 


Teri = E [r,, 1] + AE, P px Poe — AE, È px ls 
T"* = 
= CF yp F Chiri — Mir (A-7) 


In Equation (A.7) E[.] is the conditional expectation operator and AE,,,[.] equals E,,,[.] 
L.]. Equation (A.7) illustrates the components of realized returns. In particular, changes 
in expected future roe (i.e., cash flow news, cn,,,,) and changes in expected discount rates 
(i.e., return news, 77,,,) cause realized returns, r,,,, to differ from expected returns, 
er,,,. Hence, in order to draw meaningful inferences about the reliability of a particular 
measure of expected returns, we must control for cash flow news and return news. 


Estimation of p 

We estimate p for five portfolios—four equal-size portfolios are formed on the mag- 
nitude of the price-to-dividend ratio and a fifth portfolio includes all non-dividend-paying 
stocks. Because p is a function of the unconditional (i.e., long-run) price-to-dividend ratio, 
we set it equal to the median of the annual estimates of p, obtained from the following 
pooled, cross-sectional regression equation, which is obtained by rearranging Equation 
(A.5): 


(b, — mi — Viney T r06,,)) = py + p, X (bu, — maj). (A.8) 

In Equation (A.8) + is a number between t and t—9 and the sample includes all firm-years 

in the same price-to-dividend portfolio with requisite data in year t—9 through year t (i.e., 
for each year t we estimate p, using firm years between year t—9 and year 1). 

*° Equation (A.6) implies that: 0 = AE, [b, — m,] = AE, b p"! x mess — AE,., P p"! x "s. 
i Aw 

which equals AE,,, Š px mee = ie — Er. j) + AE, I pl x A This, in turn, implies 

ve! Awa 
Equation (A7). It also implies that cash flow news relates to changes in expectations about all future roe (i-c., 


t+], t+2,...), whereas return news relates only to changes in expectations about discount rates pertaining to 
years occurring after year t+1. 
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The values of p for the different price-to-dividend portfolios are: (1) non-dividend- 
paying stocks p = 0.988, (2) fourth quartile of price-to-dividend ratio for dividend paying 
stocks (ie., the quartile with the highest price-to-dividend ratio): p = 0.957, (3) third 
quartile of price-to-dividend ratio for dividend-paying stocks: p = 0.921, (4) second quartile 
of price to dividend for dividend-paying stocks: p = 0.927, and (5) first quartile of price- 
to-dividend ratio for dividend-paying stocks: p = 0.924. 


APPENDIX B 
Statistical Approach for Estimating Measurement Error Variances 


As discussed in Section II of the main text, the bias in the estimates of a,,,, obtained 
from Equation (2) does not provide clear evidence about the relative measurement error in 
a particular expected return proxy. To understand better the bias in a,,,, it is helpful to 
consider the following re-expression of Equation (1) from the main text: 

Figg ™ Crusi T+ Chipi T Mas 
= (ef, — Viger) + (Chari — uuu) + Misi 7 Vrsi) (B.1) 
CP iggy + Cf usc Mga T (— vua 7 Ua 7 Uy) 


= Past T Veit 


In Equation (B.1) v,,,,, v,,,, and v,,, denote the measurement error in ef,,,, cfi, ,,, and 
rfi,,,, respectively. Recall, rfi,,,.,, which we refer to as the return news proxy, is the negative 
of return news (1.e., —71,,,». We assume the measurement error in a particular proxy is 
uncorrelated with the true underlying construct, but may be correlated with the true values 
of the other constructs and the measurement errors in the remaining proxies. While we are 
primarily interested in vj,,,, it is unobservable. However, we can use the combined mea- 
surement error, v,,,4,, to evaluate the extent to which v,,,, contributes to the bias in a,,,. 
Results presented in Rao (1973), Garber and Klepper (1980), and Barth (1991) demonstrate 
that v4,,, and the bias in o,,,, are related in the following manner: 


Vent = Moe, — Bias (0,4) X ef, I — Bias (05,,,) X cñ, — Bias (04,4) 


X rfi, T+ £g 
reet (ats - Pascua - E xa 
(zen Pun - ascen xn 


In Equation (B.2) the B and o? (g) are the slope coefficients and residual variances obtained 
from the set of regressions shown in Equation (B.3): 


ef, = Bratt + Bii X Chari + Biasi X ary T Eu 
fis, = Borsi + Bauer X Cfi + Bos X rua + Eug (B.3) 
Thari = Booi T+ Psi X éef,, t Bozi X Cusi + 84,4. 


The Accounting Review, April 2005 


An Evaluation of Accounting-Based Measures of Expected Returns 535 


Mirti To and 04,4 are functions of the variances of the measurement errors in éf,,,,, 
Cfi, ,, and rfi,,,, respectively. Hence, we refer to the n terms as “noise variables." The 
relation between the noise variables and the covariance structure of the measurement errors 
is: 


Nie = (va) + {00r Ved T Cur Vsi42)) 
+ [OCCT pats Voie) + OCs 1) Vairr i) } 

nan = 9 (6,4) + [OW ur Vier + CMe User} (B.4) 
+ [o(cn,, Vinti) + O(Chn,, 1 v3] 

Nai = 025644) + (ouis Dra) T ous Vries) 
+ [o(rn,,o bua) t omues Wari) 


In Equation (B.4) 07(,,,.;), 0?(u,,,,), and o?(,,,,) denote the variance of the measurement 
error in ef,,,, cf1,,,, and 7fi,,,, respectively. The terms in the first set of braces correspond 
to covariances between the measurement errors in the proxies (e.g., o(vj,,,, v4,4) is the 
covariance between the measurement error in ef,,, and the measurement error in cf, ,). 
The terms in the second set of braces after the equal sign correspond to covariances between 
true, unobservable constructs and measurement errors in the proxies (e.g., o(er,,,, Voz4 1) is 
the covariance between true expected return and the measurement error in cfi, ,). 

Equations (B.2) through (B.4) demonstrate that the difference between one and a,,,, 
is a complex function of the covariance structure of the independent variables and the 
covariance structure of their measurement errors. Hence, this difference is not solely at- 
tributable to measurement error in the expected return proxy (i.e., v,,,,). However, we can 
use Equations (B.2) through (B.4) to infer the portion of the bias in a,,,, that is attributable 
to v4,,. We follow the two-stage process described in Barth (1991). 

In the first stage we estimate the regressions shown in Equation (B.3). Next, we com- 
bine the regression coefficients, residuals, and residual variances from these regressions to 
develop a set of constructs that are used as the regressors in the following second stage 
regression, which is obtained by rearranging Equation (B.2): 


£ — B | 
Vener = Tei + (7114.1) X ( Eco e| 3E (Nz) 
l#+1 


Wri — Porti = Ezur 7 "sru ) 
* | TON TENET ( ZONE 


+ Bani 
= Borer + Orei X Eel + Òai X £241 + Š, X El F Meyi 


Note each of the regressors in Equation (B.5) is a function of a particular error term from 
one of the regressions shown in Equation (B.3). Hence, we refer to these regressors as 
adjusted errors (e.g., the regressor that is a function of £,,,, is referred to as e$, ,). 
Ranking on the basis of ò., embeds the assumption that {00m wv, 
t Oyj i> v44,4)) and (o(er,,,, tuti) + OCCT zt 1. v4444)) are constant across expected return 
proxies, which may not be descriptive. To circumvent this problem we refine the regression 
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shown in Equation (B.5) by replacing the regressand (i.e., v,,,,) with vZ,,,, which is defined 
as: 


VOI = Uccisi T {Tle iri Chur) + o(efuus rfi) X En (B.6) 


In Equation (B.5) o(ef,,,,, cn,,,) denotes the covariance between the expected return proxy 
of interest and the cash flow news proxy, and o(ef,,,,, rfi,,,) is the covariance between the 
expected return proxy and the return news proxy. The coefficient on e%,,, obtained via 
estimation of Equation (B.6) is?" 


Otte = Nyi 7 {P(C frt Ais) + Oefe fu) 
Orne) — [O(er,+, cna) + oO(er,,, mua) 


~ {TVu Mie) + CU gst 5,4)). (B.7) 
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ABSTRACT: In this study, | examine the association between the credibility of the 
flnancial reporting system and the quality of governance mechanisms. | use a sample 
of 87 firms identified by the SEC as fraudulently manipulating their financial statements. 
Consistent with prior research, results indicate that fraud firms have poor governance 
relative to a control sample In the year prior to fraud detection. Specifically, fraud firms. 
have fewer numbers and percentages of outside board members, fewer audit com- 
mittee meetings, fewer financial experts on the audit committee, a smaller percentage 
of Big 4 auditing firms, and a higher percentage of CEOs who are also chairmen of the 
board of directors. However, the results indicate that fraud firms take actions to improve 
thelr governance, and three years after fraud detection these firms have governance 
characteristics similar to the control firms in terms of the numbers and percentages of 
outside members on the board, but exceed the control firms in the number of audit 
committee meetings. | also investigate whether the improved governance Influences 
informed capital market participants. The results indicate that analyst following and 
institutional holdings do not Increase in fraud firms, suggesting that credibility was still 
a problem for these firms. However, the results also Indicate that firms that take actions 
to improve governance have superior stock price performance, even after controlling 
for earnings performance. This suggests that investors appear to value governance 
improvements. 
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L INTRODUCTION 

mpirical evidence indicates that weak corporate governance is associated with finan- 

cial reporting fraud (e.g., Dechow et al. 1996 [hereafter DSS]; Beasley 1996), but 

little is known about the actions that fraud firms take to improve their weak gover- 
nance after fraud detection and, perhaps more importantly, how effectively these actions 
restore investor trust. Given the importance of the relation between the quality of gover- 
nance mechanisms and the credibility of the financial reporting system, it is surprising that 
we know so little about the nature and extent of this relation. In this study, I seek to fill 
this gap by investigating the association between the revelation of financial reporting fraud 
and subsequent improvements in the quality of corporate governance mechanisms and the 
corresponding economic consequences of such improvements. This evidence is important 
because the quality of corporate governance is at the forefront of concerns of regulators 
and corporate management. 

Regulators, perhaps in response to the recent flurry of highly publicized financial re- 
porting frauds (e.g., Enron), are considering or have enacted rules intended to strengthen 
the quality of corporate governance.! Much of the debate surrounding these proposals re- 
volves around the idea that stronger governance is associated with more credible financial 
reporting. Some firms accused of fraud are using a recovery strategy that includes gover- 
nance improvements. For example, in a recent financial press article (Sorkin 2002), Edward 
D. Breen, who was appointed CEO of Tyco following allegations of fraudulent financial 
reporting there, commented on the state of the quality of corporate governance at Tyco: 


The most important thing, in my opinion, when I came into the company, and I've 
stated this many times, was fixing corporate governance ... (emphasis added) 


Breen appears to believe that there is a net benefit to improving his firm's governance 
mechanisms, perhaps in the form of a positive stock price reaction. 

It seems obvious that firms caught committing fraud would take some actions with 
respect to their governance mechanisms, the most likely of which is to discharge the culprits 
associated with the fraud. However, it is not as obvious that these firms would necessarily 
improve their governance after fraud detection because they could merely replace inside 
directors with other insiders, rather than with outside directors, for example. 

My first main research objective, therefore, is to examine whether there is an association 
between the detection of financial reporting fraud and subsequent improvements in the 
quality of governance mechanisms. I find that while fraud firms' governance structures are 
initially weak, by the end of a three-year period following the year of fraud detection these 
firms have a similar board of director profile and proportion of firms with the combined 
CEO/COB position to that of their matched control firms. I also find that fraud firms hold 
more audit committee meetings than their matched control firms by the end of this same 
period. Overall, these results suggest that improving the quality of governance is important 
to fraud firms. 

Given that regulators are mandating new governance rules and that fraud firms are 
likely to be expending scarce resources on governance improvements after fraud detection, 
it is important to document whether these improvements provide any economic benefits. 
Evidence that governance improvements do indeed provide economic benefits would sup- 
port the basis for these rules and firms' expenditures on enhanced governance. However, it 
could very well be the case that these improvements do not provide any economic benefits 


' For example, NYSE (2004) and NASDAQ (2004) have both recently enacted rules requiring that their member 
firms have boards of directors with a majority of independent directors. 
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and are merely window dressing (1.e., non-value-enhancing) and thus an inefficient use of 
resources. Providing evidence on either case adds to our knowledge of the economics of 
improving the quality of the corporate governance mechanisms that monitor the financial 
reporting process. 

My second main research objective, therefore, is to examine whether these improve- 
ments aid fraud firms in restoring their reputations with informed capital market participants 
and recovering share values. Specifically, I examine the relation between improvements that 
fraud firms make in their boards and audit committees following fraud detection and the 
corresponding responses of analysts, institutional investors, short sellers, and the stock 
market. Results indicate that analyst following and institutional ownership do not increase 
in fraud firms, suggesting that credibility is likely still a problem for these firms. These 
results are not surprising given that fraud firms experience negative returns following fraud 
detection. More interestingly, I find a positive and economically significant relation between 
increases in board independence and long-run buy-and-hold abnormal returns over the 
three-year period following fraud detection, even after controlling for earnings performance. 
Overall, my findings suggest that fraud firms and, perhaps more importantly, the market, 
view improving the quality of governance mechanisms as a way of restoring trust after 
fraud. 

The remainder of this paper is organized as follows. In Section I, I provide a brief 
review of the relevant literature on governance mechanisms and fraud. Section III contains 
a description of the sample. Section IV contains the research design and empirical results. 
I summarize and conclude in Section V. 


II. LITERATURE REVIEW 

One stream of the accounting literature related to financial reporting fraud examines 
the association of governance mechanisms and the likelihood of fraud (e.g., DSS; Beasley 
1996). DSS find that firms committing financial reporting fraud are more likely to have a 
board of directors dominated by insiders and are less likely to have an audit committee. 
However, they do not find that the use of a (then) Big 6 audit firm significantly influences 
the likelihood of fraud. Beasley (1996) finds that the inclusion of larger proportions of 
outside members on the board of directors significantly reduces the likelihood of financial 
reporting fraud, but does not find significant evidence that fraud firms' audit committees 
meet less frequently. In contrast to DSS and Beasley (1996), the current study investigates 
the magnitude and economic consequences of fraud firms' improvements in important mon- 
itoring mechanisms during the three-year period following fraud detection. 

There are few extant studies that examine firm responses to fraud detection. In research 
closely related to the current study, Agrawal et al. (1999) investigate the association of 
fraud detection and subsequent turnover in senior managers and directors, but fail to find 
one. Agrawal et al. (1999) examine 103 firms that commit a broad range of frauds, but 
only four of those firms represent cases of financial reporting fraud. Thus, it is difficult to 
draw inferences from their study regarding the effect of GAAP violations on governance 
improvements. Moreover, past research (Karpoff and Lott 1993) finds that mean abnormal 
returns in a two- or three-day window around fraud detection are significantly more negative 
in cases of financial reporting fraud than in other cases of fraud. Therefore, incentives to 
overhaul governance mechanisms are likely to be greater for cases of financial reporting 
fraud firms than for other cases of fraud. However, this remains an empirical question— 
one that I examine in this study. 

Livingston (1997) and Beneish (1999) investigate senior management turnover follow- 
ing the detection of financial reporting fraud. Srinivasan (2004) investigates outside director 
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turnover after an accounting restatement. Livingston (1997) finds significant turnover in top 
managers and financial officers, while Beneish (1999) finds no unusual turnover in senior 
management. Srinivasan (2004) finds significantly higher turnover in boards for firms that 
restate earnings downward and that the likelihood of director departure increases in restate- 
ment severity. Unlike the Livingston (1997) and Beneish (1999) studies, the current study 
focuses on the mechanisms intended to oversee managerial actions, rather than on man- 
agement itself. This is important because agency costs will continue to exist as long there 
is separation of ownership and control in a firm. Replacing top management following fraud 
detection does not get to the root of the problem. As Jensen (1993) notes, ''... bad systems 
or rules, not bad people, underlie the general failings of the board of directors." Therefore, 
enhancing tbe quality of governance mechanisms is more likely to ensure that managers 
do not deviate from their fiduciary duties. Unlike the Srinivasan (2004) study, the current 
study focuses on changes in a more comprehensive set of governance variables. The cur- 
rent study also examines these changes around a financial reporting fraud, a distinctly 
different event from an earnings restatement, which does not necessarily involve fraud. 
Moreover, the current study also investigates the economic consequences of these gover- 
nance changes for the firms themselves. 

The only extant study of which I am aware that provides direct evidence on the market's 
response to changes in boards of directors is that by Rosenstein and Wyatt (1990), who, 
in a nonfraud setting, find a small positive mean abnormal return over a two-day window 
centered on the announcement dates of outside director appointments. It is possible, how- 
ever, that the market will discount or ignore governance changes made by fraud firms 
because the revelation of fraud has so badly tarnished these ürms' reputations. Therefore, 
our understanding of the economic consequences of governance changes after fraud detec- 
tion is an open empirical issue that warrants investigation. 


III. SAMPLE SELECTION 
Fraud Sample 


The fraud sample consists of publicly held companies cited in United States Securities 
and Exchange Commission (SEC) Accounting and Auditing Enforcement Releases 
(AAERsY for violating SEC Rule 10b-5 during the period 1982-2000. An illustrative ex- 
ample of the SEC's justification for issuing AAERs under the fraud provisions of the 
Exchange Act of 1934 follows (per AAER No. 689, in the case of Kurzweil Applied Intel- 
ligence, Inc.): 


Section 10(b) of the antifraud provisions of the Exchange Act (of 1934) and Rule 10b- 
5 thereunder proscribes the making of materially false and misleading statements ‘‘in 
connection with the purchase or sale of any security." Violations of Section 10(b) and 
Rule 10b-5 occur when an issuer makes material misstatements in registration state- 
ments, prospectuses or periodic reports filed with the Commission and trading thereafter 
occurs in the issuer's securities. The filing of false and misleading reports is also a 
violation of Section 10(b) of the Exchange Act (of 1934) and Rule 10b-5 thereunder 
because reports of publicly traded companies affect the markets for offer, sale and 
purchase of their securities. 


2 The SEC issues AAERs against firms and others (e.g., audit firms, managers, and directors) for violations of 
SEC rules. 
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. I use AAERs as a proxy for the occurrence of fraud. Consistent with Beasley (1996) 
and Bonner et al. (1998), this proxy is intended to capture extreme cases of fraud in the 
error-to-fraud continuum. A focus on AAERs is justified because while the SEC cannot 
possibly investigate every case of fraud, it does target the highest profile cases (Feroz et 
al. 1991). Consequently, the fraüd sample used in this study is likely.to include the most 
egregious cases of fraud and thus provides a powerful setting to examine the impact of 
fraud detection on governance improvements because if no effect is found in this setting, 
then it is not likely to be found in any other. However, the use of AAERS has limitations. 
For example, because the SEC selects cases for which it has the best chance of winning a 
judgment, they are likely to include instances of the most extreme misleading reporting. 
Therefore, the results of this study are not likely to be generalizable to the entire population 
of firms that report fraudulently. 

I restrict my analyses to frauds that were detected through 1997 to allow for a three- 
year period after fraud detection in which to examine governance changes. The analysis 
period is restricted to five years to make data collection more tractable. Table 1, Panel A 
reconciles the number of AAERs issued over my analysis period to my final fraud sample. 
There were 1,357 AAERs issued from 1982 to 2000, of which 1,067 did not involve 
financial statement fraud, were duplicates, or were outside the scope of the analysis period. 
I was unable to locate proxies or financial statement data for another 195 firms. I include 
in the final fraud sample each fraud firm that has either proxy statements or SEC Forms 
10-K available for at least one year before and at least one year after fraud detection. This 
sample selection criteria initially leads to 95 firms in the fraud sample, but because I was 
unable to identify matches for 8 of the fraud firms, the final fraud sample size is 87. Once 
I identify a firm for inclusion in the fraud sample, I search the Dow Jones database for the 
period surrounding the fraud period identified in the AAER to determine the fraud detection 
date. Following Feroz et al. (1991), if no indication of the fraud detection date appears in 
Dow Jones, I use the AAER date as a proxy for the detection date. During my review of 
the AAERs, I carefully examined each one to ensure that the fraud firms identified were 
cited for false and misleading financial reports. Moreover, I noted no cases where a firm 
was cited for violating Rule 10b-5 only for reasons other than the issuance of false or 
misleading financial statements. 

Panel B of Table 1 indicates that fraud firms are widely distributed among industries, 
with some clustering of firms in machinery and computer equipment, measurement instru- 
ments, and business services. Panel C of Table 1 indicates that 60 percent of the frauds 
involve fictitious transactions (i.e., those that are unambiguously fraudulent and include 
creating phony invoices for sales that do not exist and never will), that fraud firms overstate 
their net income by a mean (median) of 174 percent (53.70 percent) and that the mean 
(median) length of the fraud is 6.66 (6.00) quarters. 


Control Sample 

Prior research (e.g., DSS) and the current study show significant clustering by industry 
among fraud firms. Therefore, examination of governance changes surrounding fraud de- 
tection is likely to reveal characteristic governance behavior that is associated with changes 
in the industry rather than with fraud detection. To control for this possibility, I use a 


3 Because AAERs are typically issued two to three years subsequent to fraud detection (see, Feroz et al. 1991), 
. I reviewed them through the year 2000 to ensure that I identified all frauds through the year 1997 (AAERs 
contain information regarding the fraud period). 
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matched-pairs design to test changes in corporate governance mechanisms following fraud 


detection. 


For each fraud firm, I select a control firm with the same four-digit SIC code, with net 
sales within +25 percent of the fraud firm's net sales for the year preceding fraud detection, 


TABLE 1 


Fraud Sample Selection, Fraud Firm Characteristics, and Fraud-Specific Statistics 


Panel A: Sample Selection of 87 Firms Subject to Enforcement Actions under SEC Rule 10b-5 


between 1982 and 2000 


Number of AAERs issued between 1982 and 2000 1,357 
Less: 
AAERs not involving financial statement fraud, duplicate AAERs, or AAERs outside 1,067 
scope of analysis period (e.g., unintentional misapplication of GAAP, AAERs against 
auditors, AAERs issued for frauds occurring after 1997) 
Firms without proxy or financial statement data 195 
Firms without an available matching non-fraud firm 8 
Final Sample 87 
Panel B: Distribution of Fraud Firms by SIC Code 
Two-Digit Number 
SIC Code . Industry Description — — of Firms 
13 Oil and Gas Extraction 5 
22 Textile Mill Products 1 
23 Apparel 2 
26 Paper and Allied Products I 
27 Printing and Publishing 2 
28 Chemicals 5 
32 Stone, Clay and Glass 1 
34 Fabricating Metals 5 
35 Machinery and Computer Equipment 10 
36 Electrical 9 
37 Transportation Equipment 1 
38 Measurement Instruments 9 
39 Miscellaneous Manufacturing l 
42 Motor Freight Trans, Warehousing 1 
45 Air Transportation I 
49 Electric, Gas, Sanitary Services 4 
50 Wholesale Durable Goods 5 
52 Building Materials ] 
56 Apparel and Accessory Stores 2 
59 Miscellaneous Retail 4 
61 Nondepository Credit Institution 1 
73 Business Services 12 
79 Amusements and Recreation 1 
80 Health Services 2 
99 Miscellaneous 1 
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TABLE 1 (continued) 


Panel C: Fraud-Specific Statistics 
Type of GAAP Violation? Number of Firms Percentage of Total 
Fictitious Transactions 52 60 
Timing Difference 31 36 
Transactions 
Unidentified 4 E. 
Total 87 100 
Median Mean Std. Deviation Max Min 
Overstatement as 53.70 174.20 361.86 2,169 0 
percentage of net | 
income 
Length of GAAP Violation 6.00 6.66 4.21 19 1 
(in quarters) 


* Fictitious transactions are unambiguously fraudulent and include phony invoices for sales that do not exist and 
never will. Timing difference transactions represent premature recognition of legitimate transactions that 
eventually occur. 


and within the same stock exchange. If I cannot find a match within the same four-digit 
SIC code, then I relax my matching criteria to include firms with the same three-digit SIC 
code, and if necessary, match firms on the basis of a two-digit SIC code. The potential 
control firm is included in the final matched control sample if there are no reports of fraud 
in the Dow Jones Database for that firm in the two years before and three years after the 
fraud detection year of its corresponding fraud firm. Additionally, I search AAERs issued 
during the analysis period to ensure that the SEC has not cited the potential control firm. 
The final non-fraud sample consists of 87 firms. Table 2 indicates that fraud and non-fraud 
firms do not differ significantly based on net sales, market value, or exchange listing.^ 


IV. RESEARCH DESIGN AND EMPIRICAL RESULTS 
Changes in Governance Variables 


Prior research (Beasley 1996; DSS) has established that, compared to matched control 
firms, fraud firms have weaknesses in several key governance mechanisms. These weak- 
nesses include boards with lower percentages of outside directors, a higher proportion of 
firms with the combined CEO/COB position, a lower proportion of firms with audit com- 
mittees, fewer audit committee meetings, and a smaller percentage blockholder ownership. 
Other governance mechanisms likely to be important to the financial reporting system 
include the composition of the audit committee, including its percentage of independent 
directors and financial experts, the quality of the external audit firm, and the percentage of 
ownership by managers and directors. 

Several empirical studies show that boards composed mainly of outside directors are 
more effective than boards composed mainly of insiders (e.g., Brickley and James 1987; 
Weisbach 1988; Rosenstein and Wyatt 1990), and that boards are ineffectual monitors when 
the board's equity ownership is small and when the CEO is also Chairman of the Board 
(Jensen 1993). Agrawal et al. (1999) provide the only empirical study of which I am aware 
that investigates changes in both senior management and directors after fraud detection. 


* [also test for the difference in age between fraud and control firms and find no statistical difference. 
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TABLE 2 
Matching Statistics for Fraud and Control Firms 
Control Control 
Fraud Firm Firm Fraud Firm Firm 
Mean Mean Median Median 
(m millions) (in millions) (in millions) (in millions) 
Variable (n = 87) (n = 87) t-statistic (n = 87) (n = 87) p-value 
Net Sales 373.61 343.59 0.77 37.07 44.85 0.28 
Market Value 336.59 315.94 0.40 55.48 31.39 0.14 
Control 
Exchange Listing Fraud Firm Firm 
NYSE 17 17 
AMEX 5 5 
NASDAQ/OTHER 65 65 
Total 87 87 


Fraud firms are matched with control firms on the basis of year, net sales, SIC code, and stock exchange listing. 
The t-statistic is for the difference between the means of the matched pairs. The p-value is for the Wilcoxon 
signed rank test. 


They find no significant turnover in either senior management or directors in the three years 
after fraud detection. However, it is difficult to draw inferences from their study about the 
detection of GAAP violations as a catalyst for making improvements in governance because 
their sample included only four GAAP violators. 

Sommer (1991) suggests that having an audit committee as part of the governance 
structure and having an effective audit committee are two different matters; further, the 
audit committee needs to meet on a regular basis without management in order for it to be 
effective. He also notes that the audit committee, because it has constant access to the 
internal audit staff, external audit firm, and other corporate personnel, is likely to be the 
first to identify a potential irregularity. His study supports the notion that there is a positive 
relation between the number of meetings an audit committee holds and its effectiveness. 
Beasley (1996), however, finds no significant difference between fraud and non-fraud firms 
in the number of audit committee meetings held in the year prior to fraud detection. The 
Blue Ribbon Committee on Improving the Effectiveness of Corporate Audit Committees 
(BRC) (1999) suggests that the composition of the audit committee is also likely to be 
critical to its performance. I therefore examine whether the composition of the audit com- 
mittee differs for fraud and control firms along the dimensions of size, number of outside 
directors, and number of financial experts. 

The extant literature finds no significant difference between fraud and non-fraud firms 
in audit-firm quality as measured by the use of Big 4 auditing firms versus non-Big 4 firms 
in the year prior to fraud detection (DSS; Beneish 1997). Several studies find an association 
between audit-firm quality and audit-firm size (e.g., DeAngelo 1981). One implication of 
this finding is that a larger audit firm is likely to be a stronger monitor than a smaller one. 
Palmrose (1988) suggests that (then) Big 8 audit firms are sued less frequently than other 
audit firms because they provide higher quality audits. DeFond (1992) finds a positive 
relation between audit-firm quality, as proxied by audit-firm size and name brand (i.e., Big 
4), and its ability to mitigate agency problems. In sum, these findings suggest that Big 4 
audit firms provide better monitoring than other categories of audit firms. 
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Firms choosing governance mechanisms likely also evaluate their costs. Empirical stud- 
ies find that internal control improvements are costly to firms in terms of the time and 
effort required to improve governance (Yermack 1996; Klein 2002). Moreover, there is a 
prohibitive cost associated with establishing internal controls that would eliminate entirely 
the probability of (future) frauds (Jensen 1993). Notwithstanding these costs, because of 
the severity of the agency problems of the fraud firms used in this study, it is likely that 
their costs of not improving governance are greater than their costs of doing so. Economic 
theory suggests that efforts to repair the financial reporting system must necessarily include 
significant improvements in its monitoring components. More specifically, agency theory 
suggests that the demand for monitoring is positively related to the prevalence of agency 
costs (Fama 1980; Fama and Jensen 1983). Because fraud is a manifestation of acute 
agency costs, I expect that fraud firms will experience a greater increase in: (1) outside 
director percentage, (2) audit committee activity, (3) audit committee independence, (4) 
audit committee financial expertise, and (5) audit-firm quality; and a greater decrease in 
the proportion of firms with the combined CEO/COB position after fraud detection than 
will their corresponding control firms over the same period. Blockholders are also likely to 
play a key monitoring role (Jensen 1993), but because they are an external monitoring 
mechanism, it is difficult to predict changes in their post-fraud. detection ownership. It is 
also difficult to predict the change in inside ownership by management and directors given 
that there is likely to be substantial turnover in both following the detection of fraud. I 
therefore do not make predictions for changes in the holdings of blockholders, managers, 
or directors. 

- I obtain data on these governance variables from the firm's proxy statement or SEC 
Form 10K. Figure 1 illustrates the direction of the changes in the governance variables for 
the sample of fraud firms from the year prior to fraud detection compared to the third year 
afterward. Figure 1, Panel A indicates that increases occur in the percentage of outside 
directors? (OutsideDir%), and the percentage of stock held by blockholders (Block%), while 
the percentage of firms with the combined CEO/COB position (CEO=COB%) and the 
percentage of stock held by management and directors (InsideOwn%) declines. Panel A 
also indicates that the percentage of firms with a Big 4 audit firm (Big4%) and the per- 
centage of institutional ownership (InstOwn%) remain virtually the same. 

Figure 1, Panel B provides evidence that fraud firms increase the number of outside 
directors on their boards (#OutsideDir), the number of audit committee meetings 
(#AudComMeet), the number of audit committee members (#A4udComMbrs), and the num- 
ber of outside directors on their audit committees (AudComOutsideDir). This figure also 
indicates that the number of financial experts? (#FinlExperts) remains virtually unchanged. 

Table 3 provides tests of significance of these governance changes using a set of control 
firms. I first test whether the sample of fraud firms used in this study has a similar gov- 
ernance profile to that used in prior studies. I then test my predictions about changes in 
these governance variables. As reported in Table 3, Panel A, I find that the mean 
OutsideDir?o in the year preceding fraud detection (Initial) is 47 percent for fraud firms 
compared to 56 percent for control firms, a statistically significant difference (t = —2.54), 
which is consistent with Beasley (1996) and DSS. Panel A also documents that ZOutsideDir 
in Initial is 3.3 for fraud firms compared to 4.1 for control firms, a statistically significant 


* Consistent with Rosenstein and Wyatt (1990), I define an outside director as a director who is not a present or 
former employee of the firm and whose only formal connection to the firm is his/her duty as a director. 

° I define a financial expert as one who has accounting or related financial management expertise (i.e., a person 
who is or has been chief financial officer, or other senior corporate officer with financial oversight responsibility). 
This definition is consistent with that in the report of the Blue Ribbon Committee (1999). - 
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FIGURE 1 l 
Governance Variables for Fraud Firms in the Year Prior to Fraud Detection (Initial) and 
the Third Year Afterward (Final) for Frauds Detected from 1982 to 2000 


Panel A: Means of Governance Variables m Percentage (sample size = Initial/Final) 


v 


oosSe6588838568s 





OutsideDir?; ^ Big496 CEO=COB% Block% InstOwn?6 — InsideOwn% 
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Panel B: Means of Governance Variables in Levels (sample size = Initial/Final) 


Elinitial Year 
Wi Final Year 


Level 





OutsideDir% = outside director percentage; 
Big4% = percentage with Big 4 audit firms; 
CEO=COB% = percentage with the combined CEO/COB position; 
Block% = percentage of shares held by > 5 percent blockholders; 
InstOwn% = the percentage of shares held by institutions that file SEC Form 13F; 
InsideOwn% = the percentage of shares held by management and directors; 
#OutsideDir = number of outside directors; 
#Directors = number of directors; 
#AudComMeet = number of audit committee meetings; 
#AudComMbrs = number of audit committee members; 
#AudComOutsideDir = number of outside directors on audit committee; 
#FinlExperts = number of financial experts on audit committee; 
Initial Year = the year prior to fraud detection; and 
Final Year — the third year after fraud detection. 


difference (t = —2.08). The change in OutsideDir% is greater for fraud firms in years 1 
through 4, thus providing support for the hypothesis that fraud firms will increase their 
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percentage of outside directors by more than control firms after fraud detection." The change 
in ZOutsideDir is also generally greater for fraud firms than for control firms over the 
analysis period, although this difference is statistically significant only after year 2. Over 
the analysis period, the change for ZDirectors is generally not different for fraud and control 
firms, except for year 1, where fraud firms have a larger decline than their corresponding 
control firms (t — —2.41). These results suggest that fraud firms improve their reputational 
capital through the addition of outside directors, both in number and proportion, although 
the decline in the total number of directors accounts for part of this result.’ 

As reported in Table 3, Panel B, I find that the mean #AudComMeet for fraud firms 
in Initial is 1.61, which is statistically smaller than the mean of 1.97 for control firms 
(t = —1.46). This result contrasts with that of Beasley (1996) and likely obtains due to 
more power in my tests—69- matched pairs in the current study versus 26 matched pairs 
in the Beasley (1996) study. 

Table 3, Panel B also provides the cumulative numerical increase in ZAudComMeet 
from its level in Initial to the fourth year afterward. The change in the mean #AudComMeet 
is greater for fraud firms than for control firms over the entire analysis period, thus pro- 
viding support for the hypothesis that fraud firms increase the number of audit committee 
meetings by more than control firms. The mean #AudComMeet at Final for fraud firms is 
3.00 versus 2.33 for control firms. This difference is statistically significant at the 10 percent 
level (one-tailed). Overall, fraud firms experience an increase of some 80 percent in 
#AudComMeet from the pre-fraud detection year to the fourth year afterward. It appears 
that in the wake of fraud detection the audit committee has at least visibly demonstrated 
that it is paying more attention to the financial reporting process. 

Results provided in Table 3, Panel B also indicate that fraud firms have significantly 
fewer ZFinlExperts in Initial, with 0.46 financial experts for fraud firms versus 0.80 financial 
experts for control firms (t = —2.40). This difference, however, persists at the end of the 
analysis period. I also find that while fraud firms have a majority of #AudComOutsideDir 
in the pre-fraud detection year, there is no difference between fraud and control firms in 
#AudComOutside in Initial or in any year through the fourth year afterward. One reasonable 
interpretation of these results is that although fraud firms' audit committees contained a 
majority of outsiders and some level of financial expertise, it appears that they did not pay 
enough attention to the financial reporting process, as evidenced by the fewer number of 
meetings held in Initial. Therefore, the number of audit committee meetings appears to be 
relatively more important than the composition of the audit committee for explaining the 
likelihood of committing fraud. 

Results in Table 3, Panel C indicate that in the year prior to fraud detection 75 percent 
of fraud firms retain a Big 4 auditing firms compared to 85 percent for control firms, à 
statistically smaller percentage at the ten percent level (t — —1.82).? This result contrasts 
with those in DSS and Beneish (1997) and likely obtains because the severity of the frauds 
examined in this study likely exceeds the severity of the frauds examined in their studies 
in terms of the SEC rule violated. Untabulated results indicate that at the third year after 
fraud detection the difference between fraud and control firms relative to their use of Big 4 


~d 


Results are qualitatively similar for tests of all governance variables holding constant the sample size at Final. 
The possibility exists that the results obtained for changes in outside director percentage are due to mean 
reversion. However, changes in outside director percentage require some explicit action by management and 
therefore are not like other variables, such as returns, where changes can occur without actions directed by 
management. I thank a referee for pointing this out to me. 

? This result suggests that there is relatively little room for monitoring improvement along this dimension for 
most fraud firms. Therefore, I exclude audit-firm changes from subsequent market-based analyses. 
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audit firms persists, with 75 percent of fraud firms retaining Big 4 firms versus 88 percent 
of the control firms, a statistically significant difference (p = 0.03). Overall, fraud firms 
are more likely than control firms to switch audit firms after fraud detection, but not audit- 
firm types. In the year of fraud detection, 25 percent of fraud firms switch audit firms, 
compared to 9 percent of control firms, a statistically significant result (p — 0.00). In the 
year after fraud detection, 24 percent of fraud firms switch audit firms, compared to 7 
percent of control firms, again statistically significant (p — 0.00). For fraud firms, the 
majority of switches occurs within Big 4 firms, with 55 percent and 62 percent of the 
switches in the fraud detection year and year afterward, respectively. These results imply 
that in the post-fraud detection period fraud firms were at least able to maintain a high 
level of audit-firm quality, as measured by Big 4 audit firms. 

Panel C of Table 3 also indicates that in Initial, 94 percent of fraud firms have 
CEO=COB% compared to 80 percent for control firms, a statistically greater proportion 
(t = 2.79). This result is consistent with that in DSS, who find that 86 percent of fraud 
firms have the combined position compared to 74 percent of control firms in the year prior 
to fraud detection. I find that by the end of the analysis period fraud and control firms have 
an indistinguishable difference in this variable. Panel C also reports that fraud firms have 
a significantly lower Block% in Initial (t = —2.05) compared to control firms, consistent 
with the finding in DSS. This difference disappears by the end of the analysis period, 
indicating that blockholders increase their holdings in fraud firms. I also find that while 
fraud firms and their matched control firms do not differ in InstOwn% in Initial, fraud firms 
experience a greater decrease in InstOwn% for each year of the analysis period, but do not 
differ in InstOwn® at the end of the analysis period. The sample size for this variable is 
very small and one therefore needs to carefully interpret these results. Finally, I find no 
difference in InsideOwn% between fraud and control firms in Initial or in any of the four 
years afterward. 

In sum, I find that, in the year prior to fraud detection, fraud firms exhibit weakness 
along the governance dimensions of outside director percentage, the number of outside 
directors, the number of audit committee meetings, the number of financial experts on their 
audit committees, the quality of the external audit firm, the proportion of firms with the 
combined CEO/COB position, and the percentage of blockholder ownership. More inter- 
estingly, I find that by the end of the three years following fraud detection, fraud firms 
have indistinguishable differences from their matched control firms in outside director per- 
centage and the proportion with the combined CEO/COB position; they also experience a 
greater number of audit committee meetings than their matched control firms. Overall, this 
evidence is consistent with my hypotheses and likely indicates that fraud firms wish to 
report their financial information more credibly. Whether this results in the restoration of 
fraud firms' reputations with informed capital market participants and the stock market are 
issues that I explore next. 


Responses to Governance Changes 

Overall, the results presented thus far in this paper indicate that fraud firms experience 
an increase of less than one outside director and an increase of one audit committee meeting 
in the three years after fraud detection. It is not clear how to interpret the economic sig- 
nificance of these results. These increases likely have a second-order effect on the resto- 
ration of the firms' reputations, whereas improving the underlying economics of these firms 
likely has a first-order effect. In this section, I attempt to sort out these effects and examine 
whether there is a relation between governance improvements and the responses of informed 
capital market participants and the stock market. 
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Informed Capital Market Participants 

In this subsection, I investigate whether improvements in (1) outside director percent- 
age, (2) the number of outside directors, and (3) audit committee activity affect informed 
capital market participants, such as analysts, institutional investors, and short sellers. Spe- 
cifically, I test for a relation between governance improvements and whether: (1) analyst 
following and institutional ownership increase and (2) short sales decrease. 

I obtain data on analyst following from I/B/E/S and data on institutional holdings 
from the CDA Investment Technologies Spectrum database, which is derived from the SEC 
Form 13-F disclosure forms reported quarterly to the SEC. I hand-collect monthly short 
interest data from the Standard & Poor's Daily Stock Price Record, except as noted for 
NASDAQ firms for the period subsequent to 1987. For NASDAQ firms, I obtain short 
interest data for the years subsequent to 1987 directly from the NASDAQ. 

It is possible that the level of governance improvements affects informed capital market 
participants. I therefore divide fraud firms into two groups, one for changes greater than 0 
in outside director percentage, the number of outside directors, and the number of audit 
committee meetings from the pre-fraud detection year to the fourth year afterward; and one 
for those changes less than or equal to 0 over the same period. I then compare the means 
of the associated changes in the levels of analyst following, institutional ownership, and 
short sales for the two groups. Results in Table 4 indicate that changes in each of the 
governance variables do not affect analyst following, institutional ownership, or short sales. 

One possible explanation for the result with respect to institutional investors is that the 
governance improvements are only a necessary condition for attracting large investors and, 
given the severity of the frauds and the related fall-out, these investors are likely to be 
precluded from investing in these firms because of their fiduciary responsibilities to only 
invest in sound firms. It is likely, therefore, that the lingering effects of the fraud have kept 
institutional investors from increasing their ownership in the fraud firms. It logically follows 
that the lack of institutional interest likely affects analysts' decisions regarding coverage of 
these firms. One plausible explanation for the lack of result with respect to short interests 
is that these positions are likely to close out when the stock hits bottom, which is likely 
be around the time of fraud detection. I next investigate whether governance improvements 
are related to changes in the market values of the fraud firms. 


Stock Returns 

Prior research finds negative abnormal returns in a three-day window surrounding the 
announcement date of fraud (e.g., Feroz et al. 1991; DSS). If governance improvements 
are viewed as credible signals, then they ought to positively affect firm value through a 
decrease in the cost of equity capital. Given the difficulties (e.g., missing forecasts) of 
directly estimating changes in cost of equity capital associated with governance improve- 
ments, I take an indirect approach, similar in spirit to that used in DSS, and proxy for it 
using buy-and-hold abnormal returns.'? That is, increases in returns would be consistent 
with a decline in the cost of equity capital because returns directly affect the cost of equity 
capital. | 

The results in Table 3 indicate that, for fraud firms, the number of outside directors, 
outside director percentage, and the number of audit committee meetings are weak relative 
to their corresponding control firms in the pre-fraud detection year, but strengthen by the 


10 fn subsequent analyses, I use ROA as a proxy for expected cash flows because ROA could be driving the returns. 
I also test for a relation between changes in governance and corresponding changes in bid-ask spread (arguably 
a proxy for cost of equity capital), but fail to find one. 
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TABLE 4 
Association of Mean Changes for Fraud Firms in Outside Director Percentage, Number of 
Outside Directors, and Number of Audit Committee Meetings with Mean Changes in 
Institutional Ownership, Short Sales, and Analyst following from the Year Prior to Fraud 
Detection to the Third Year Afterward 


Panel À: Analysis Based on Mean in Outside Director Percentage 


t-stat for 
Change > 0 Change = 0 Difference 
(n) (n) (p-value) 
Outside Director Percentage 22.20 —14.83 9.10 
(55) (27) (« 0.001) 
Institutional Ownership —0.01 —0.01 0.47 
(20) (15) (0.64) 
Short Sales —0.01 0.00 —0.97 
| (22) (9) (0.34) 
Number of Analysts —3.09 —1.81 —0.98 
(55) (27) (0.33) 
Panel B: Analysis Based on Mean Changes in the Number of Outside Directors 
Number of Outside Directors 2.22 —0.93 8.97 
(37) (45) (< 0.001) 
Institutional Ownership 0.00 —0.01 0.21 
(16) (19) (0.83) 
Short Sales —0.01 —0.01 —0.24 
(11) (13) (0.81) 
Number of Analysts —2.68 —2.61 —0.05 
(36) (46) (0.96) 
Panel C: Analysis Based on Mean Changes in the Number of Audit Committee Meetings 
Number of Audit Committee Meetings 2.52 —0.47 5.37 
(34) (18) (« 0.001) 
Institutional Ownership —0.01 0.00 —0.87 
(16) (7) (0.39) 
Short Sales —0.01 0.00 — 0.32 
(16) (8) (0.75) 
Number of Analysts —2.00 —3.84 1.18 
(34) (18) (0.24) 


end of the fourth year afterward.!! To investigate whether these improvements are associated 
with future returns, I first compare the magnitude of changes in these variables with sub- 
sequent changes in buy-and-hold returns (BHAR) using the CRSP equal-weighted index for 
the three-year period subsequent to fraud detection (see below for details on calculation of 
BHAR). I obtain returns data from CRSP. Figure 2 provides a histogram that indicates that 
fraud firms that make positive changes in outside director percentage, the number of outside 
directors, and the number of audit committee meetings have significantly (p = 0.05) less 


1! While the percentage of firms with the combined position of CEO/COB is also initially weak, only 11 out of 
58 firms combined the positions of CEO and COB after the fraud detection year and 47 of the 58 firms 
experienced no change. Therefore, I exclude this variable from further analysis. 
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FIGURE 2 
Buy-and-Hold Abnormal Returns for the Three Years after Fraud Detection and the 
Associated Governance Changes for Fraud Firms from the Year Prior to Fraud Detection 
to the Third Year Afterward for Frauds Detected from 1982 to 2000 


[Bousiseberd  Becusceni] Garudcomles 
AT s i Bu 
B EN E 


[s] Change > 0 
li Change x 0 


Three-Year BHAR 





Three-Year BHAR = the mean three-year buy-and-hold abnormal return for the sample of fraud firms for the 
three years after fraud detection using the CRSP equal-weighted index; 
AOutsideDir% = change in outside director percentage from the year prior to fraud detection to the third 
year afterward; 
AfOutsideDir = in in number of outside directors from the year prior to fraud detection to the third 


year afterward; 
A#AudComMeet = change in number of audit committee meetings from the year prior to fraud detection to the 
third year afterward; 
Change > 0 = positive governance changes from the year prior to fraud detection to the third year 
; and 
Change = 0 = nonpositive governance changes from the year prior to fraud detection to the third year 
afterward. 


negative BHAR than do firms that make non-positive changes.'? Because the percentage of 
outside directors and the number of outside directors are likely to be highly correlated and 
because the effect of increasing the percentage of outside directors has a larger impact on 
future returns than does increasing the number of outside directors, I exclude the change 
in the number of outside directors from the regression analysis below. 

I analyze more rigorously the relation of improvements in the quality of the board of 
directors and audit committee activity with abnormal returns over the three-year period 
following fraud detection by estimating the following regression: 


BHAR,, = a + B,AOutsideDir%,, + B,A#AudComMeet,, + BAROA,, 
+ B,BMV,, + BSMVE,, + e, , (1) 


where: 


'2 T also use ROA as a measure of future performance, but fail to find a significant relation between changes in 
the governance variables mentioned above and ROA (results not presented). 
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BHAR,, = buy-and-hold abnormal return for firm i at time t, where t is as defined 
below; 

AOutsideDir%,, = lagged change in outside director percentage from the pre-fraud de- 
tection year to the fourth year afterward for firm i at time t, where t 
is as defined below; 

A#AudComMeet,, = lagged change in the number of audit committee meetings from the 
pre-fraud detection year to the fourth year afterward for firm i at time 
t, where t is as defined below; 
AROA,, = change in net income scaled by total assets over the three years fol- 
lowing fraud detection; 

BMV,, = book value per share divided by the market value per share for firm i 
at time t, where ¢ is as defined below. Book and market value are 
measured as of the end of the year prior to the analysis year (Source: 
Compustat-book value, and CRSP-market value); 

MVE,, = log of the market value of equity for firm i at time #, where t is as 
defined below. Market value is measured as of the end of the year 
prior to the analysis year (Source: CRSP); and 

e,, = the residual for firm i at time t, where t is as defined below. 


I compute long-run buy-and-hold abnormal returns (BHAR) for fraud firm i and control 
firm/portfolio m as follows: 


4 

R,, = H d dnd Q) 
4 

R,-ll 52-1 (3) 
r=? 


where r,, and r,,, are the annual returns of fraud firm i and control firm/index m, respec- 
tively, on date z. The BHAR for each fraud firm is computed as the difference between the 
buy-and-hold returns of the fraud firm and its respective control firm/index: 


BHAR,, = R,, — R,, (4) 


The BHARS are computed with respect to (a) control firms, (b) CRSP equal-weighted 
index, (c) CRSP value-weighted index, and (d) CRSP S&P 500 Index. Additionally, in 
order to compute BHAR for the three years after fraud detection, only those firms with 
CRSP data for this three-year period are included in the following analyses. 

Panel A of Table 5 reports descriptive statistics for these variables and for raw returns 
for fraud and control firms and the CRSP equal-weighted index for the three years following 
fraud detection. This panel indicates that fraud firms experience a mean (median) raw return 
of —22 percent (—32 percent), compared to mean (median) raw returns for control firms 
and the CRSP equal-weighted index of 39 percent (0.00 percent) and 31 percent (26 per- 
cent), respectively. These findings illustrate that while fraud firms experience negative ab- 
normal returns over the analysis period, they also experience negative raw returns. This 
result is consistent with that provided in Figure 2. ` 
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TABLE 5 
Descriptive Statistics and Pearson Pairwise Correlations for Returns, Change in Net Income, 
Governance Changes, and Control Variables for Fraud Firms for the Three-Year Period 





following Fraud Detection 
Panel À: Descriptive Statistics of Returns, Governance Variables, and Control Variables 
(n — 34) 

First Third 
iro, E nn Bu Dev Median uarie Vuartile 
BHAR —0.53 0.70 —0.83 — 1.10 0.02 
Fraud..Return ~0.22 0.69 0.32 —0.71 0.52 
Control_ Return 0.39 1.35 0.00 —0.43 0.56 
EW..CRSP 0.31 0.27 0.26 0.12 0.53 
AROA —0.14 0.60 0.02 —0.08 0.12 
AOutsideDir% 6.86 12.74 5.35 0.00 12.94 
BMV 8.63 50.63 0.02 0.01 0.08 
MVE 4.40 1.76 4.12 3.40 5.34 


Panel B: Pearson Pairwise Correlations for Returns, Change in Net Income, Governance Changes, and 
Control Variables (n = 34) 


BHAR AROA AOutsideDir A#AudComMeet BMV — — MVE — Fraud Return 

BHAR 10090 14.69  22.6%*** 7.896 21.99p*** 7.996 92, 5*** 
AROA 10096 22.596 6.196 13.896 7.396 8.496 
AOutsideDir% 100% 7.7% —0.996 —6.896 31.8%* 
A#AudComMeet 100% 28.2%*** 9.5% 12.7% 
BMV 100% —44.9%*** 15.8% 
MVE 100% 13.5% 
Fraud ..Retur 10096 


* *** Significant at the 10 percent and 1 percent levels, respectively, based on two-tailed tests. 
BHAR = the buy-and-hold abnormal return for fraud firms for the three years following fraud 
detection using the CRSP equal-weighted index; 
Fraud... Return = the raw buy-and-hold return for fraud firms for the three years following fraud detection; 
Control... Return = the raw buy-and-hold return for control firms for the three years following fraud detection; 
EW_CRSP = the CRSP equal-weighted index return for the three years following fraud detection; 
AROA = the change in net income scaled by total assets over the three years following fraud 
detection; 
AOutsideDir% m the lagged change in outside director percentage from the pre-fraud detection year to the 
fourth year ; 
A¥AudComMeet = the lagged change in the number of audit committee meetings from the pre-fraud detection 
year to the fourth year afterward; 
BMV = the book value per share divided by the market value per share; and 
MVE = the log of the market value of equity. 


Panel B of Table 5 provides the Pearson pairwise correlations for the regression vari- 
ables and the raw returns for fraud firms (Fraud_ Return) over the three-year period fol- 
lowing fraud detection. For my sample of fraud firms, BHAR is positively correlated at the 
one percent level with AOutsideDir% (22.6 percent), BMV (21.9 percent), and Fraud_ 
Return (92.5 percent). AOutsideDir% is also positively correlated with Fraud_.Return (31.8 
percent) at the 10 percent level. Also, A£AudComMeet is positively correlated with BMV 
(28.2 percent) at the 1 percent level. However, AROA, A#AudComMeet, and MVE are not 
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significantly correlated with BHAR over the analysis period. In short, these pairwise cor- 
relations appear reasonable and do not seem large enough to create multicollinearity prob- 
lems in the regressions presented in Table 6.!? 

Table 6 presents the results of the regressions of buy-and-hold abnormal returns on the 
explanatory variables in Equation (1).'* The dependent variable is the buy-and-hold abnor- 
mal return for fraud firms for the three-year period following fraud detection based on the 
CRSP equal-weighted index. I obtain qualitatively similar results estimating these regres- 
sions using BHAR based on the CRSP value-weighted and CRSP S&P 500 indices, and 
control firm approach (results not tabulated). I use lagged governance variables because 
most of the changes in the governance variables take place by the end of the second year 
after fraud detection (see Table 3). Results in Table 6 confirm the relative importance of 
improvements in outside director percentage in explaining abnormal returns. In Model 1, 
the coefficient on AOutsideDir?o is positive and statistically significant at the 5 percent 
level (t = 2.32), whereas the coefficient on AZAudComMeet is negative, but not statistically 
significant. This finding provides evidence that the market values increases in board inde- 
pendence, and is consistent in spirit with the results in Rosenstein and Wyatt (1990), but 
is ten times the economic significance of that found in their study. The coefficient on 
AOutsideDir% is 0.022, which implies that a 1 percent increase in outside director per- 
centage is associated with a 2.2 percent increase in BHAR, an economically significant 
result. The coefficients on BMV and MVE are not statistically significant. Another interest- 
ing finding from Model 1 is that the adjusted R? is 18.50 percent. This very high R? is 
possibly due to AOutsideDir?o being a substitute for changes in earnings. 

Therefore, in Model 2, I test for the significance of AROA in explaining returns. Results 
for Model 2 indicate that the coefficient on AROA is not statistically significant, with an 
adjusted R? of 0.00 percent. In Model 3, I test for the incremental explanatory value of 
AOutsideDir% over AROA. The coefficient on AOutsideDir® is again statistically signifi- 
cant, but at a slightly lower level of significance (t = 2.2, 5 percent level). Results also 
indicate that while the coefficient on AROA is positive, it is again statistically insignificant. 
The overall explanatory power of Model 3 is slightly lower than that of Model 1, with an 
adjusted R? of 15.25 percent. In sum, the results from this section are consistent with the 
notion that improvements in outside director percentage following fraud detection are 
strongly linked to returns over the same period. Moreover, changes in net income over the 
same period provide only marginal explanatory value with respect to returns. 

It is possible that the results obtained for Models 1 and 3 are driven by the correlated 
omitted variable of survivorship bias. That is, the results are likely to be overstated 1f poorly 
performing firms are more likely to drop out of the analysis than are those firms that are 
performing well (or less poorly), and. if those poor performers are in the group with the 
smallest governance change. To test this possibility, I perform a binomial proportions test 
to determine whether there is a differential survivorship rate for firms in the group with the 
largest increase in the percentage of outside director representation and for those firms in 
the group with the smallest such increase. I partition the fraud sample into three equal 
groups based on the frequency distribution of changes in outside director percentage from 


13 I also conduct more formal tests of collinearity on the regressions presented in Table 6. The highest condition 
index for any variable is 7.8 (index values above 30 would indicate a collinearity problem). 

14 I also estimate regressions that include other controls for firm performance—lagged AROA, AEPS, and lagged 
AEPS—with no qualitative difference in the results. I also include variables that interact a yearly dummy with 
AOutsideDir?6 to assess whether the speed with which governance changes were made had a differential impact 
on returns. The coefficients on the interaction terms are zero, with no difference among any of the years (results 
not tabulated). 
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TABLE 6 
Regressions of Fraud Firms’ Long-Run Buy-and-Hold Abnormal Returns for the Three Years 
after Fraud Detection on Governance Changes from the Year Prior to Fraud Detection 


to the Third Year Afterward 
Dependent Variable: BHAR 
(t-statistic) 
Variable (1) (2) (3) 
Intercept — 1.03 —1.02 —1.01 
(—2.57)** (—2.26)** (-2.38)** 
AOutsideDir96 0.022 0.022 
(2.32)** (2.19)** 
A#AudComMeet —0.05 —0.05 
(—1.47) (—1.46) 
AROA 0.11 0.04 
(0.47) (0.20) 
BMV 0.86 1.83 0.83 
(0.68) (1.35) (0.64) 
MVE 0.10 0.09 0.10 
(1.33) (1.08) (1.26) 
n 34 34 34 
Adj. R? 18.5096 0.00% 15.25% 


** Significant at the 5 percent level, based on two-tailed tests. 
BHAR m the buy-and-hold abnormal return for the three years following fraud detection using the 
CRSP equal-weighted index; 
AOutsideDir% m the lagged change in outside director percentage from the pre fraud detection year to the 
fourth year afterward; 
AfAudComMeet = the lagged change in the number of audit committee meetings from the pre-fraud detection 
year to the fourth vear afterward; 
AROA = the change in net income scaled by total assets over tbe three years following fraud 
detection; 
BMV = the book value per share divided by the market value per share; and 
MVE = the log of the market value of equity. 


the pre-fraud detection year to the fourth year afterward. Results (untabulated) indicate that 
32.14 percent of fraud firms with the largest increase in outside director representation drop 
out of the analysis versus 26.92 percent of those firms with the smallest such increase. This 
difference is not statistically significant (Z = 0.42, p = 0.67). Additionally, I test whether 
the change in outside director percentage and abnormal returns are statistically different for 
surviving firms and those that dropped out of the analysis. Results (untabulated) of a stan- 
dard t-test indicate that there is no difference in either the increase in outside director 
percentage or abnormal returns for these firms (t = —0.50 p = 0.62 and t = 0.30 p = 0.76, 
respectively). 

The results from the regressions need to be interpreted with caution. Analyses based 
on long-run buy-and-hold abnormal returns are fraught with well-established econometric 
and interpretation issues.!? However, this paper does provide some evidence consistent with 


13 [ considered conducting a short-window event study to examine the market’s response to outside director ap- 
pointments, but because only a scant number of director appointments ior my sample firms are disclosed prior 
to the issuance of proxy statements, I am unable to identify a sufficiently large enough sample to execute such 
a study. 
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the notion that increasing outside director percentage subsequent to fraud detection yields 
economic benefits in the form of higher abnormal returns. 


V. SUMMARY AND CONCLUSIONS 

This is the first study of its kind that provides evidence on the link between the cred- 
ibility of the financial reporting system and the quality of governance mechanisms by 
investigating changes in these mechanisms subsequent to fraud detection and the corre- 
sponding economic consequences of such changes. It uses a sample of 87 firms identified 
by the SEC as fraudulently manipulating their financial statements. Consistent with prior 
research, results indicate that fraud firms have poor governance relative to a control sample 
in the year prior to fraud detection. Specifically, fraud firms have fewer numbers and per- 
centages of outside board members, fewer audit committee meetings, fewer financial experts 
on the audit committee, a smaller percentage of Big 4 auditing firms, and a higher per- 
centage of CEOs who are also chairmen of the board of directors. However, the results 
indicate that fraud firms take actions to improve their governance and that three years after 
fraud detection these firms have governance characteristics similar to the control firms in 
terms of the numbers and percentages of outside members on the board, but exceed the 
control firms in the number of audit committee meetings. This study also investigates 
whether improved governance influences informed capital market participants. The results 
indicate that analyst following and institutional holdings do not increase in fraud firms, 
suggesting that credibility was still a problem for these firms. However, the results also 
indicate that firms that take actions to improve governance have superior stock price per- 
formance, even after controlling for earnings performance. 

This study makes several contributions to the accounting literature. It adds to our knowl- 
edge of the link between the credibility of the financial reporting system and quality of 
governance mechanisms by extending research on the causes and consequences of financial 
reporting failures (Beasley 1996; DSS; Agrawal et al 1999; Beneish 1999). This study also 
sheds light on the efficacy of a post-fraud detection recovery strategy that involves gover- 
nance improvements by providing evidence that there are positive economic benefits that 
accrue to fraud firms that improve the independence of their boards. Finally, this study 
provides insights into the agency problem by utilizing a unique setting that provides a rare 
opportunity to observe agency costs directly and to study the governance measures that 
firms implement in an attempt to reduce these costs. 

My research design, which is intended to include firms most likely to exhibit an increase 
in governance quality following the detection of financial reporting fraud, necessarily fo- 
cuses on the most egregious GAAP violators. An issue to be resolved with future research 
is whether the inferences of this study are generalizable to samples of firms with a broader 
class of financial reporting credibility problems (e.g., earnings restatements). 
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ABSTRACT: Employment relationships provide fertile ground for both employee and 
employer opportunism. Employers worry about whether employees will devote suffi- 
cient effort to work, and employees are concerned about v/hether employers will com- 
pensate them appropriately. In this paper, we examine whether employer discretion 
over the size of the total employee compensation pool and the allocation of this pool 
among employees influences employee and employer opportunism. The results of our 
experiment indicate that firm output and employees' compensation are greater when 
the employer does not have discretion over total employee compensation, but does 
have discretion over the allocation of total compensation. We find that the employer's 


` residual profit increases with discretion over the allocation of compensation among 
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employees; however, we find no effect on residual profit of the employer's discretion 
over the total amount of employee compensation. Our results suggest that firms benefit 
from a compensation contract that establishes total employee compensation as a pre- 
determined function of public, aggregate measures such as accounting income, but 
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provides the employer at least some discretion to allocate this compensation using 
private information. However, our results caution that employees and employers may 
not have similar preferences for the degree of employer discretion over the determi- 
nation of total employee compensation. 


Keywords: compensation; discretion; subjective performance evaluation; double-sided 
moral hazard. 


Data Avallability: Contact the authors. 


I. INTRODUCTION 
mployment relationships typically are characterized by two-sided concerns regarding 
opportunistic behavior (Demski 1997, 579). Employers are concerned about whether 
employees will work diligently, while employees worry about whether employers 
will reward this work appropriately. The potential for employees and employers to act in 
their own self-interest when making these choices threatens to destroy the cooperation 
needed for productive firm outcomes. 

Opportunism may not arise if workplace norms result in cooperation among employers 
and employees. However, firms typically do not rely solely on workplace norms to pre- 
empt opportunism, and implement control systems, such as performance evaluation and 
compensation systems, to mitigate opportunism. These systems govern two compensation 
decisions: (1) how to allocate total remuneration between the employer and employees and 
(2) how to allocate total employee remuneration among employees (Baiman and Rajan 
1995). The first decision establishes the size of the total employee compensation pool and 
the second determines the allocation of this pool among employees. The approach a firm 
uses to make these decisions can influence both employer and employee opportunism. 

In this paper, we examine whether an employer's discretion over the two compensation 
decisions affects employee and employer opportunism. The effect of employer discretion 
over the two compensation decisions is unclear because discretion creates tension with 
respect to employer and employee opportunism. An employer can use discretion to her 
personal benefit, increasing her share of total compensation at the expense of employees 
. or allocating compensation to “favored” employees.’ However, discretion also allows an 

employer to reduce employee opportunism by determining employee compensation using 
measures and approaches that best reflect employees' collective and individual efforts. 

The tension between employer discretion and employer/employee opportunism typi- 
cally is connected to firms' information systems. Firms' formal information systems, such 
as accounting systems, provide information that is publicly observable or verifiable but 
often reflects only aggregate employee performance (Baiman and Rajan 1995). Employers 
can alleviate employees' concerns about employer opportunism by giving up discretion and 
committing ex ante to a total compensation pool and its allocation among employees based 
on such public information. However, public information may not reflect all relevant aspects 
of performance. In particular, aggregate information provides little basis for employers to 
differentially reward individual employee performance and prevent employees from free- 
riding on the efforts of other employees. Thus, compensation approaches based on publicly 
available information may reduce employer opportunism but may not prevent employee 
opportunism. 


' Throughout the paper, we use feminine pronouns to refer to employers and masculine pronouns to refer to 
employees. 
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Firms also have informal sources of information. This information can reflect multiple 
aspects of aggregate performance and/or isolate the performance of individual employees, 
but is privately observed by the employer. An employer can reduce employees” opportunism 
by retaining discretion to use her private information in performance evaluation. For ex- 
ample, an employer can reduce the benefit of free-riding by using her private information 
on individual performance to allocate the compensation pool among employees. Unfortu- 
nately, allowing the employer discretion to use private information generates employee 
concerns that the employer will fail to reward employees appropriately for their aggregate 
or individual efforts. Thus, a compensation scheme in which an employer has full discretion 
to use private information may reduce employee opportunism but allows for employer 
opportunism. 

The relative strengths and weaknesses of restricting versus allowing employer discretion 
suggests that there may be complementary roles for discretion versus nondiscretion with 
respect to the two compensation decisions. Specifically, an approach in which the employer 
does not have discretion over the size of the total employee compensation pool (hereafter 
pool size) but does have discretion over the allocation of the employee compensation pool 
(hereafter pool allocation) may alleviate concerns about both employer and employee op- 
portunism (Batman and Rajan 1995). This approach suggests that there also are comple- 
mentary roles for public and private information— public information provides the basis for 
the total compensation pool and private information assists in the allocation of this pool. 

To investigate the effects of employer discretion on employee and employer opportun- 
ism, we examine situations in which the employer either has full discretion or no discretion 
over the size of the employee compensation pool and/or the allocation of this pool among 
employees. We use a laboratory experiment with a 2 (pool size discretion) X 2 (pool 
allocation discretion) X 8 (period) mixed design. In the experiment, the employer either 
has discretion to determine the size of the two-employee compensation pool (discretionary 
pool size, DS) or the pool is set at two-thirds of the revenue from total employee output 
(nondiscretionary pool size, NDS). Similarly, the employer either has discretion in allocat- 
ing the compensation pool between the two employees (discretionary pool allocation, DA) 
or the pool is split evenly between the two employees (nondiscretionary pool allocation, 
NDA). The employer receives the residual profit after subtracting employee compensation 
from total revenue. | . 

In the experiment, 237 undergraduate business students, in groups of three, acted either 
as an employer or as one of two employees. Employees allocated three minutes between a 
work task (decoding numbers into letters) and compensated leisure for each of eight periods. 
After each work session, employers in the discretionary conditions determined the size of 
the employee compensation pool and/or the allocation of this pool between the two em- 
ployees. Employers received noisy private information about individual employee output to 
use if desired in making their compensation decisions. Thus, in the discretionary conditions, 
employers had the opportunity to more closely align an employee's compensation with 
individual production than in the nondiscretionary conditions in which compensation was 
a multiple of total output and/or evenly split between the two employees. 

We consider the impact of employer discretion over pool size and pool allocation on 
total output, as well as the division of the value of total output (i.e., revenue) between 
employer remuneration and employee remuneration. The results indicate that both total 
output and employee compensation benefit from restricting the employer's discretion over 
the size of the compensation pool, and allowing the employer to have discretion over the 
allocation of this pool. Total output and employee compensation were greater when the 
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employer did not have discretion over pool size than when she did have discretion, as well 
when the employer had discretion over the pool allocation than when employees received 
equal compensation. These results hold even when we eliminate the last two periods to 
control for possible end-game effects. 

Our results for the effect of discretion on the employer's residual profit are mixed. Over 
the entire eight periods, the employer's residual profit did not differ between the discre- 
tionary and nondiscretionary pool size conditions. The employer's residual profit was 
greater in the discretionary pool allocation condition than in the nondiscretionary condition. 
However, we found no effect of either pool size or pool allocation discretion on employer's 
residual profit after eliminating the final two periods. This result suggests that discretionary 
pool allocation primarily mitigates employee opportunism that arises as the employment 
relationship nears its end. 

Our results also document a fairly high level of cooperative behavior between employ- 
ers and employees in the absence of controls that mitigate opportunism. In the DS-NDA 
contract, which provides no explicit controls to mitigate employee or employer opportun- 
ism, employees produced, on average, 75 percent of the amount produced in the best- 
performing NDS-DA contract. Additionally, employers with discretion over pool size paid 
out, on average, 59 percent of revenues as compensation to the two employees. However, 
the effects of opportunism associated with either a discretionary pool size or nondiscre- 
tionary pool allocation approach increased over the eight periods. For example, in the last 
period, output in the DS-NDA contract was only 45 percent of that in the NDS-DA contract. 
Moreover, employers in the two discretionary size conditions paid only 51 percent of rev- 
enue to employees in the last period. 

This study contributes to the control systems literature by examining implications of 
employer discretion in performance evaluation and compensation for employer and em- 
ployee opportunism. Our results document a fairly high level of productive output in the 
absence of specific controls related to employer discretion/nondiscretion intended to miti- 
gate employee and employer opportunism. Rather, employers and employees appear to 
recognize the economic benefits of joint cooperation and rely on social values, such as trust 
and reciprocity, to keep productive outcomes intact over all but the final periods of em- 
ployment (Axelrod 1984; Berg et al. 1995; Sprinkle 2003). These findings highlight that 
the efficacy and cost of performance evaluation and compensation arrangements should be 
compared not simply against each other, but against the cost and efficacy of naturally 
occurring social mechanisms in the workplace. 

Despite these cooperative outcomes, our study documents benefits of control systems 
with explicit incentives to mitigate employer and employee opportunism. Specifically, al- 
lowing the employer discretion to allocate compensation among employees increases firm 
output and both employee and employer remuneration. However, the implications of our 
results for discretion in setting the size of total employee compensation are less clear. While 
eliminating employer discretion over the pool size improves output and employee compen- 
sation, the employer's residual profit is not greater under the nondiscretionary pool size 
case than under the discretionary pool size case. In essence, the benefit to the employer of 
increased output associated with a nondiscretionary pool size is not sufficiently large to 
Offset the cost to the employer of having to pay greater per unit compensation to employees 
in our experiment. Thus, employers may prefer to retain discretion over both pool size and 
pool allocation, to the detriment of firm output and employee compensation. However, given 
the compensation-related benefits to employees of a nondiscretionary approach to pool size, 
competition among employers for employees could lead employers to relinquish their dis- 
cretion over compensation pool size. 
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Finally, our study complements research on the use of contractible (public) versus 
noncontractible (private) information in compensation contracts. Analytical research docu- 
ments the benefits of compensation contracts that base the total pool size on contractible 
information and the pool allocation on employer's private information (Baiman and Rajan 
1995). Our results generally provide empirical support for Baiman and Rajan's (1995) 
conclusion. Since contractible information often emanates from formal accounting systems, 
our findings also suggest that there are complementary roles in designing compensation 
systems for information from formal accounting systems and private information from a 
firm's informal information systems. 

The remainder of this paper is organized into four sections. The next section provides 
background and develops the hypotheses, and Section III describes the experiment used to 
test these hypotheses. Section IV presents the results, and Section V provides a summary 
and discussion of the results. 


IL BACKGROUND AND HYPOTHESES 
Employer and Employee Opportunism—-Discretionary Pool Size/Nondiscretionary 
Pool Allocation Contract (DS-NDA) 

We examine a setting in which employees work independently on a production task 
for a finite number of periods. Each employee allocates a fixed amount of time each period 
between work and leisure, and derives utility from both leisure time and compensation. 
Output for the group of employees is the sum of individual employee output. Firm revenue 
equals group output multiplied by the competitive market price. The employer is a residual 
claimant who receives total firm revenue less the amount paid as compensation to 
employees. 

The firm has an information system that measures employee performance and a com- 
pensation system that rewards employee performance (Zimmerman 2003, 169). To study 
our research question on the effects of employer discretion on opportunism, we vary char- 
acteristics of these two systems. We first examine a setting that provides no formal controls 
to mitigate either employer or employee opportunism and then add controls that potentially 
mitigate one or both types of opportunism. 

In our initial setting, the information system measures group output (e.g., revenue or 
income for the entire company or a division), but not individual employee output.^ The 
employer retains discretion to determine the size of employees' compensation pool. Given 
the lack of information on individual performance, the employer does not retain discretion 
over the allocation of the compensation pool. Rather, the pool is divided equally among 
the employees. We term this compensation system the discretionary size/nondiscretionary 
allocation (DS-NDA) contract. 

These choices for information and compensation systems represent company settings 
that provide fertile ground for both employer and employee opportunism. Employees have 
an incentive to act opportunistically and shirk in their work effort because their individual 
compensation is based on group performance. If an employee receives more utility from 
leisure than from the expected incremental compensation he would receive from working, 


? [n practice, employers often observe aggregate performance measures from firms' formal information systems, 
but can not disentangle individual employee performance due to complexities of the production process. In these 
settings, employers frequently use aggregated performance as a basis for incentive contracts and allocate total 
compensation in a manner unrelated to individual performance (Milgrom and Roberts 1992, 413). 
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then he may shirk and receive work compensation based on the efforts of his co-workers 
(free-ride).? 

Employers also have an incentive to act opportunistically given their discretion over 
the size of total employee compensation. Given a finite-period world, a self-interested em- 
ployer will pay no employee compensation in the last period since it only reduces her 
residual payment and has no incentive value for employees' future work-leisure allocation 
decisions (employees have already made all such decisions). If employees believe the em- 
ployer will not pay compensation in the final period, then they have no incentive to work 
and will allocate all time to leisure. Backward induction implies a Nash equilibrium in 
which the employer does not pay compensation and employees do not work in any period. 

Both employer and employee opportunism threaten to destroy productive outcomes that 
could benefit all firm members. This threat arises from employer discretion over total em- 
ployee compensation and employee discretion over work effort, combined with incentives 
for employers and employees to use their discretion for personal gain. 

The threat of opportunism may not materialize if employees and employers choose to 
cooperate with one another, either out of recognition of their mutual benefit or out of fear 
that others will retaliate against opportunistic behavior (Fehr and Gáüchter 2000). Such 
cooperative behavior requires trust that others will not act opportunistically. Research on 
trust games shows that individuals are willing to place themselves at risk by cooperating 
and trusting that other individuals will not exhibit self-interested behavior (Berg et al. 1995). 
If all employees initially trust that other employees will choose to work rather than to shirk, 
then a productive equilibrium can be sustained as long as all employees reciprocate. Indeed, 
Fisher et al. (2003) find that employees' work allocations and performance are significantly 
greater than zero in a setting that does not provide a compensation-based mechanism to 
mitigate employee opportunism.? 

Employees also must trust that the employer will not act in her own self-interest, but 
instead will reciprocate by compensating employees appropriately. This expectation is not 
without foundation, as research indicates that employers reward employees who exert high 
levels of effort (Fehr et al. 1997). Additionally, over time, an employer can develop a 
reputation for compensating employees that induces employees to work in advance of re- 
ceiving compensation (Bull 1987). 

Thus, it is possible that extreme opportunism will not arise in the DS-NDA contract 
due to social values such as trust and reciprocity. However, if these social values are not 
sustainable workplace norms, then other mechanisms are needed to mitigate employee and 
employer opportunism. The firm's performance evaluation and compensation system can 
serve as such a mechanism. In the next section, we use implications of a paper by Baiman 
and Rajan (1995) to consider how changes to the employer's discretion in setting employee 
compensation can help to mitigate employer and employee opportunism. Specifically, we 
examine three compensation contracts that change one or both aspects of the DS-NDA 


3 This is similar to public goods settings in which individuals have the incentive not to contribute to the common 
pool, but consume the benefits of contributions from other individuals (Ledyard 1995). l 
* For example, the employer knows that paying compensation in the next-to-last period cannot motivate employees’ 
effort for that period since production has already occurred, but could motivate employees' effort in the last 
period. However, the employer knows that employees will not work in the last period given that she can not 
commit to paying compensation. Thus, there is no incentive for her to pav compensation in the next-to-the-last 
iod. 


peri 
5 However, Fisher et al. (2003) include a production goal in their setting, which may act as a separate mechanism 
influencing employee behavior. 
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contract with respect to employer discretion/nondiscretion in determining the size of the 
employee compensation pool and/or the allocation of this pool among employees. 


Compensation Contracts to Mitigate Employer and Employee Opportunism 


Baiman and Rajan (1995) provide insights on features of compensation contracts that 
have potential to mitigate employee and employer opportunism. Baiman and Rajan (1995) 
(hereafter B-R) examine a contract in which the employer commits ex ante to determine 
the size of the employee compensation pool based on jointly observable (contractible) 
information, but does not commit to a specific pool allocation. Rather, the employer has 
access to private (noncontractible) information about an individual employee's performance 
and retains the right to use this information to allocate the compensation pool. B-R ana- 
lytically show that this contract results in a strict Pareto improvement over a compensation 
contract in which the employer commits to determining compensation using only jointly 
observable information. 

The findings in B-R highlight the importance of employer discretion for employee and 
employer opportunism associated with the DS-NDA contract. The optimal contract in B-R 
has two features that differ from the DS-NDA contract. First, in the B-R contract, the 
employer gives up discretion to set the compensation pool size ex post. Second, the em- 
ployer retains discretion to allocate the compensation pool among employees using her 
private information related to individual employee performance. Thus, B-R's analytical 
results suggest that employee and employer opportunism can be mitigated by choices related 
to employer discretion. We discuss these issues below for employee and employer 
opportunism. 


Mitigating Employee Opportunism—Discretionary Pool Size/Discretionary Pool 
Allocation Contract (DS-DA) 

In the DS-DA contract, the employer has complete discretion over both the amount of 
the compensation pool and the allocation of the pool among employees. While the DS-DA 
contract does not alleviate employer opportunism due to this discretion, the contract pro- 
vides an opportunity for mitigating employee opportunism. Specifically, if the information 
system is expanded to include the employer's private noisy signals about each employee's 
output (as in B-R), the employer can use her discretion to determine the allocation of 
compensation among employees based on private information about individual employee 
performance. By more closely aligning an employee's compensation with his individual 
performance, the employer reduces the benefits of shirking to the employee, which should 
increase his work efforts. 


Mitigating Employer Opportunism—Nondiscretionary Pool Size/Nondiscretionary Pool 
Allocation Contract (NDS-NDA) 

In this contract, the employer has no discretion in setting either the size of the com- 
pensation pool or its allocation among employees; in essence, the employer has no role in 
determining employee compensation. The total employee compensation pool is a function 
of output (percentage of total revenue) and employees receive an equal share of the com- 
pensation pool. The equal division of the compensation pool among employees provides 


Š The characterization of the employer's private information as a noisy signal of individual employee performance 
reflects the idea that individual employee output typically is not easily identified (Prendergast 1999, 11). How- 
ever, employers often receive information related to aspects of individual employee performance. This infor- 
mation typically is private and based on subjective assessments. If the signals were public rather than private, 
then an employer could explicitly tie an employee's compensation to his individual performance signal. 
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no formal controls for employee opportunism; however, the NDS-NDA contract does mit- 
igate employer opportunism by taking away the employer's discretion. Note that the NDS- 
NDA contract is more extreme in this respect than the optimal B-R contract. The employer 
has no discretion in the NDS-NDA contract, while the employer in the B-R contract retains 
discretion over the parameters of the compensation pool but commits ex ante to these 
parameters using jointly observable information. Thus, the NDS-NDA contract focuses on 
eliminating employer opportunism, as opposed to creating an optimal contract from the 
employer's viewpoint, as in B-R.? 


Mitigating Both Employee and Employer Opportunism—Nendiscretionary Pool Size/ 
Discretionary Pool Allocation Contract (NDS-DA) 

In the NDS-DA contract, the compensation pool is a predetermined function of group 
output, but the employer has discretion in allocating the pool among employees. Employer 
opportunism is mitigated because the employer has no discretion over the size of the com- 
pensation pool. Moreover, the employer can reduce employee opportunism by using her 
private signals about individual performance to align an employee’s compensation with his 
performance. Thus, the NDS-DA contract has potential to mitigate both employer and 
employee opportunism. 


Hypotheses 

The prior discussion indicates that use of a nondiscretionary approach to determining 
the size of the employee compensation pool can mitigate employer opportunism. This in- 
creases employees’ willingness to devote time to work, which, in turn, increases output. 
Similarly, the use of a discretionary approach to allocating the employee compensation pool 
should mitigate employee shirking and increase output. Employees will devote more time 
to work because they expect that their employer will exercise her discretion and use infor- 
mation about their individual production to determine their share of total employee com- 
pensation. Ceteris paribus, both employees and employers should benefit from the increased 
output. These arguments lead to the following hypotheses. 


Employees’ Group Output (Performance)? 


Hla: Group output will be greater when the employer does not have discretion over 
the size of the employee compensation pool than when the employer has discre- 
tion over the size of the employee compensation pool. 


Hib: Group output will be greater when the employer has discretion over the allocation 
of the employee compensation pool than when the employer does not have dis- 
cretion over the allocation of the employee compensation pool. 


? There are several other differences between B-R and our paper. First, the B-R model is a single-period model 
while our setting includes a finitely repeated setting. Second, in the B-R model, outcomes and signals are 
dichotomous, while in our setting production outcomes and signals can take on numerous levels. Finally, in the 
B-R model, there are jointly observable signals about each employee's individual performance, whereas in our 
setting all individual signals are the employer's private information and only group output is jointly observable. 
Thus, while B-R has implications for our setting, our paper is not a direct test of their analytical predictions. 

* Predictions similar to Hla and H1b could be made for employees’ allocation of time to work. Given our 
assumption that output is a function of work time, we provide hypotheses only for output (performance). We 
report empirical results for employees' work time in footnote 13. 
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Employees' Compensation 


H2a: Employees' compensation will be greater when the employer does not have dis- 
cretion over the size of the employee compensation pool than when the employer 
has discretion over the size of the employee compensation pool. 


H2b: Employees' compensation will be greater when the employer has discretion over 
the allocation of the employee compensation pool than when the employer does 
not have discretion over the allocation of the employee compensation pool. 


Employer's Residual Profit 


H3a: The employer's residual profit will be greater when the employer does not have 
discretion over the size of the employee compensation pool than when the em- 
ployer has discretion over the size of the employee compensation pool. 


H3b: The employer's residual profit will be greater when the employer has discretion 
over the allocation of the employee compensation pool than when the employer 
does not have discretion over the allocation of the employee compensation pool. 


Notwithstanding these hypotheses, there are at least two behavioral forces working 
against the predictions. First, as noted earlier, workplace norms related to cooperation, trust, 
and reciprocity may naturally alleviate employer and employee opportunism, leaving little 
role for compensation contracts to mitigate such opportunism. Second, with respect to H1b, 
H2b, and H3b, the effectiveness of discretionary allocation to mitigate employee opportun- 
ism depends on whether an employer uses her private information in a manner that moti- 
vates employee effort (B-R). While ideally the emplover will align an employee's compen- 
sation with his individual performance, research suggests that employers are biased in 
performance evaluation (Prendergast 1999). For example, employers may be unwilling to 
make significant distinctions between employees' contributions and compress the individual 
signals, leading to a centrality bias. Alternatively, employers may make more extreme dis- 
tinctions between employees than is evident in their performance signals, resulting in a 
halo effect for higher-performing employees. 


IIl. METHOD 

Participants and Design 

Two hundred thirty-seven undergraduate students participated in our computer-based 
laboratory experiment, which employed a 2 (pool size discretion) x 2 (pool allocation 
discretion) X 8 (period) mixed design. We manipulated pool size discretion and pool 
allocation discretion between-subjects and periods (work sessions) within-subjects. We ran- 
domly assigned participants to one of the four compensation contract conditions: discre- 
tionary pool size/nondiscretionary pool allocation, discretionary pool size/discretionaty 
pool allocation, nondiscretionary pool size/nondiscretionary pool allocation, and nondis- 
cretionary pool size/discretionary pool allocation. 


? Two hundred forty-three students participated in the experiment; we eliminated two groups (six participants) 
due to computer-related malfunctions that resulted in missing data for some members of these groups. 
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Procedure 

One-third (two-thirds) of the participants assumed the role of employer (employee) in 
the experimental task. For each of the four compensation contracts, the administrator ran- 
domly matched one employer with two anonymous employees. The first few computer 
screens explained the production task, which involved decoding numbers into letters (Chow 
1983). Participants then received a description of one of the four compensation schemes 
tailored to their experimental.condition, along with several numerical examples. Each em- 
ployee's compensation contract took the following generic form in each period: 


TP, = |a X (T, — Wj)] + B, (1) 
where: 


TP, — total pay for employee i, i — 1, 2; 
a — pay per unit of leisure time consumed; 
T, — total time available for employee i; 

W, — time allocated to work by employee i; and 
B, = work compensation for employee i. 


The employer's residual profit in each period was computed according to the following 
generic formula: 


RP = [r X (O, + O,)] - (B, + Bj) (2) 
where: 


RP = residual profit received by the employer; 
r = revenue per unit of output; 
O, + O, = output of the two employees (group output); and 
B, + B, = total work compensation for the two employees. 


In each of the four contracts, T, = 180 seconds, a = 0.5 francs, and r = 3.54 francs 
(one franc, our experimental currency, equaled one cent). For the two nondiscretionary pool 
size contracts (NDS-NDA and NDS-DA), the total employee compensation piece rate was 
2.36 francs per unit of group output. 

Our choice of parameter values was driven by the determination of a piece rate that 
provides incentives for employees to shirk in the NDS-NDA contract. For employees to 
want to shirk, the leisure pay per second must exceed the compensation an employee would 
receive for devoting an additional second to work. We fixed leisure pay at a — 0.5 francs 
per second. We then determined a piece rate for each employee of 1.18 francs per unit 
based on: (1) pilot-testing in which participants decoded 58 numbers on average in three 
minutes (approximately 0.32 numbers per second) and (2) a marginal per capita return 
(MPCR) of 0.75 based on a common value used in public goods games (Ledyard 1995). 
Thus, an average participant in the NDS-NDA contract could earn either 0.5 francs from 
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allocating one second to leisure or 0.38 (1.18 X 0.32) francs from allocating one second 
to work, leading to an incentive to shirk.!? 
The contract description varied for each condition as follows. 


e In the DS-NDA compensation contract, B, was described as the total compensation 
pool determined by the employer divided by two (.e., split equally between the two 
employees). 

e The DS-DA compensation contract was described by Equations (1) and (2), with a 
note that the employer had discretion to determine B, for each employee. 

e [n the NDS-NDA compensation contract, B, was described as 1.18 X (O, + O>), 
where 1.18 francs equaled the nondiscretionary amount of individual employee com- 
pensation per unit of group output. This amount represented an equal division of 
revenue among the three firm members (3.54/3). Thus, the total employee compen- 
sation pool for the two employees was computed as 2.36 per unit of group output. 

e [n the NDS-DA compensation contract, B, was described as (d) X (2.36) x (O, 
+ O) where d, equaled the fraction of the total compensation pool the employer 
decided to give to employee i (d, + d, = 1) and 2.36 francs equaled the nondiscre- 
tionary amount of total employee compensation per unit of output. 


The computer screens then described the task. For each of eight work sessions, each 
employee allocated 180 seconds between work and leisure. After every work session, 
each employee learned the number of items he decoded correctly. The employer received 
a private signal for each employee, termed reported output. Reported output equaled the 
actual number of items correctly decoded by the employee adjusted by a randomly chosen 
number between —7 and +7 using a uniform distribution, and reflects the typical situation 
in which employers receive noisy measures of individual employee's performance.!! The 
employer and employees learned total reported group output (i.e., the sum of reported output 
for the two employees); however, only the employer knew reported output for each indi- 
vidual employee. 

All participants completed three three-minute training sessions to ensure they under- 
stood the task. Additionally, prior to starting the experimental sessions, participants an- 
swered several manipulation check questions to ensure that they understood the incentive 
contracts, the information received by employees and employers, and the decisions to be 
made by employees and employers. If a participant answered a question incorrectly, then 
the computer returned to the screen containing the appropriate information. Participants had 
to answer all questions correctly before proceeding to the work sessions. 

After each work period, the computer automatically determined the pool size and/or 
pool allocation in the nondiscretionary conditions. The employer determined the pool size 
and/or pool allocation in the discretionary conditions. Employees’ compensation and the 
employer’s residual profit were based on reported, rather than actual, output because par- 
ticipants learned total reported, but not total actual, output. After completing all eight pe- 
riods, participants completed an exit questionnaire and received their compensation. 


0 As an example, if both employees fully consume leisure, each would earn 90 francs (180 seconds x 0.5 francs) 
in every work session. If two average employees allocate all time to work in a session, each would earn 137 
francs (58 units x 2 persons x 1.18 francs/unit). However, if one employee engages in free riding, he could 
earn 9) francs from leisure plus 68 francs if the other employee works for the entire period (58 units x 1.18 
francs per unit), for a total of 158 francs. Thus, free riding is the dominant strategy when total employee 
compensation is split evenly between the two employees, although output is maximized when both employees 

rk. 


wo 
U Each employee's reported output can be viewed as a function of the number of items he decoded correctly and 
a random state of nature. Reported output was constrained to be greater than or equal to zero. 
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Dependent Measures 


We examine three dependent measures to test our hypotheses: employees’ group per- 
formance (number of items actually decoded correctly), employees' group compensation, 
and employer's residual profit. We use actual, rather than reported, performance as a de- 
pendent measure because it reflects employee output without the effect of the random error. 
For employees' group compensation, we combine the corresponding amounts for the two 
employees paired with one employer (described above in Equation (1)) to reflect the group's 
total compensation. The employer's residual profit was calculated as shown in Equation 
(2). 


IV. RESULTS 
Presentation of Experimental Results 


We present our results in three tables, one for each dependent measure: employees’ 
group performance (actual output), employees’ group compensation, and employer’s resid- 
ual profit.'* Panel A of each table presents means over the eight periods for each dependent 
measure by pool size discretion and pool allocation discretion condition. Panel B of each 
table contains the related ANOVA, with pool size discretion and pool allocation discretion 
as the between-subjects factors and period as the within-subjects (repeated measures) factor. 
Additionally, Figure 1 presents graphical results for all eight periods for the three dependent 
measures. 


Employees’ Performance (Hypotheses la and 1b) 


Table 1 and Panel A of Figure 1 present results for employees’ actual group perform- 
ance (output). Hypothesis la predicted that employee group performance would be greater 
in the nondiscretionary pool size conditions than in the discretionary pool size conditions. 
Hypothesis 1b predicted that employee group performance would be greater under dis- 
cretionary than nondiscretionary pool allocation. The results in Table 1 support these 
predictions. 

Panel A of Table 1 indicates that the mean performance over all eight periods for the 
nondiscretionary pool size conditions was 111.08, compared to mean performance of 95.02 
for discretionary pool size conditions (F = 9.90, p < 0.01, as shown in Panel B), supporting 
Hla. As predicted by H1b, the mean performance of 110.43 for the discretionary pool 
allocation conditions was higher than 95.28 for the nondiscretionary pool allocation con- 
ditions (F = 8.64, p < 0.01). 

The ANOVA in Panel B of Table 1 shows a statistically significant main effect for 
period (F = 10.67, p < 0.01), a statistically significant period-by-pool size discretion in- 
teraction (F = 8.04, p < 0.01), and a statistically significant period-by-pool allocation 
discretion interaction (F = 2.61, p < 0.02). As indicated in Panel A of Figure 1, perform- 
ance in the first period was roughly equal across the four contracts (between 103 and 112 
units); however, by the final period, there was significant variation in performance (a low 
of 52 for the DS-NDA contract and a high of 117 for the NDS-DA contract). Thus, in the 


12 Table 1 reports actual output, while Table 2 reports employee group compensation that is based on reported 
output (actual output adjusted by a randomly selected number between —7 and +7). For this reason, group 
compensation for the nondiscretionary pool size conditions (NDS-NDA and NDS-DA) in Table 2 does not equal 
output in Table 1 multiplied by 2.36 francs. 
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FIGURE 1 
Effect of Compensation Contracts on Employee Group Performance, Employee Group 
Compensation, and Employer Residual Profit by Period* 


Panel A: Average Employee Group Performance by Period” 





(continued on next page) 
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FIGURE 1 (continued) 


Panel C: Average Employer Residual Profit by Period? 
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* The four contracts are discretionary size/nondiscretionary allocation (DS-NDA), discretionary size / 
discretionary allocation (DS-DA), nondiscretionary size/nondiscretionary allocation (NDS-NDA), and 
nondiscretionary size/discretionary allocation (NDS-DA). 

P Employee group performance equals the sum across the two employees in each employer-employee group of 
the actual number of items correctly decoded by the employee. 

° Employee group compensation equals the sum of the compensation received by the two employees in each 
employer-employee group. 

? Employer's residual profit equals total revenue less compensation paid to the two employees in each employer- 
employee group. 


absence of explicit incentives, the effect of employee and employer opportunism had a 
greater impact on performance as the employment relationship approached its end.’ 


Employees’ Compensation (Hypotheses 2a and 2b) 

Table 2 and Panel B of Figure 1 present results for employees' group work compen- 
sation (the amount of firm revenue received by employees).!^ Hypothesis 2a predicted that 
employee compensation would be greater in the nondiscretionary than discretionary pool 


13 An ANOVA for reported group performance showed identical inferences to those reported in Table 1 for actual 
group performance. Additionally, an ANOVA for employee work time indicated statistically significant main 
effects for pool size discretion, pool allocation discretion, period, the period-by-pool size discretion interaction, 
and the period-by-pool allocation discretion interaction. Similar to results for performance, opportunism in 
employees' time allocations became more apparent in the later periods. In the first period, employees allocated 
about the same percentage of time to work in the worst (DS-NDA) contract and best (NDS-DA) contract (90 
percent and 94 percent, respectively). By the final period, however, employees in the DS-NDA contract allocated 
only 41 percent of their time to work, compared to 90 percent for che NDS-DA contract. 

^ We also performed an ANOVA with the dependent variable defined as employees’ work compensation plus 
leisure pay. This ANOVA -more completely reflects employees’ total benefits, i.e., the benefits that employees 
receive from both their work effort and their opportunism (time allocated to leisure). Inferences from ANOVA 
results are the same as those reported in Panel B of Table 2 for work compensation. 
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TABLE 1 
Employee Group Performance (Hla and Hib) 


Panel A: Means (Standard Deviations) for Employee Group Performance over All Periods" 








Compensation Compensation Pool Size” 
Pool 
Allocation* Nondiscretionary Discretionary Overall 
Nondiscretionary 105.92 85.18 95.28 
(21.15) (26.20) (25.79) 
n= 19 n = 20 n = 39 
Discretionary 115.99 104.86 110.43 
(14.80) (25.86) (21.55) 
n = 20 n = 20 n = 40 
Overall 111.08 95.02 102.95 
(18.64) (27.56) (24.79) 
n = 39 = 40 n = 79 
Panel B: ANOVA on Employee Group Performance 
Source of Variation SS df MS pf 
Between-Subjects 
Pool Size Condition (SC) 40,102 I 40,102 9.90 < 0.01 
Pool Allocation Condition (AC) 34,982 I 34,982 8.64 « 0.01 
SC x AC 3,645 1 3,645 0.90 « 0.35 
Group(SC X AC) 303,674 75 4,049 
Within-Subjects 
Periods 28,147 7 4,021 10.67 « 0.01 
Period x SC 21,211 7 3,030 8.04 « 0.01 
Period X AC 6,888 7 984 2.61 « 0.02 
Period X SC x AC 2,937 7 420 1.11 « 0.36 
Period X Group(SC x AC) 197,744 525 377 


* Employee groups consist of two employees paired with one employer. Employee actual group performance 

the sum across the two employees of the actual number of items correctly decoded by the employee. 

b We manipulated the determination of the compensation pool size at two levels: nondiscretionarily set at 2.36 
francs (cents) per unit of output or discretionarily set by the employer. In the nondiscretionary case, employees 
in tota] received two-thirds of revenue (3.54 per unit of output) and the employer received one-third of 
revenue. 

* We manipulated the allocation of the compensation pool between the two employees at two levels: a 
nondiscretionary equal allocation (50 percent to each employee) or a discretionary allocation by the employer. 

ån = the number of groups in each treatment condition. 

° The Group(SC x AC) mean square is the appropriate error term for the between-subjects effects. The Period 
X Group(SC X AC) mean square is the appropriate error term for the within-subjects effects. 

f We report a one-tailed p-value for the main effect of pool size and pool allocation, given the directional 
predictions for these effects. All other p-values are two-tailed. 

s We manipulated Periods as a within-subjects factors with eight levels, 1—8. Participants in all contract 
conditions completed eight work sessions. 


size conditions, while H2b predicted that employee compensation would be greater in the 
discretionary than the nondiscretionary pool allocation condition. The results in Table 2 
support these predictions. 

Specifically, Panel A of Table 2 indicates that the mean employee group compensation 
over all eight periods was 261.23 for the nondiscretionary pool size and 198.60 for the 
discretionary pool size (F = 16.07, p < 0.01 as shown in Panel B). Panel A also shows 
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Panel A: Means (Standard Deviations) for Employee Group Compensation over All Periods” 





Compensation Compensation Pool Size” 
Pool 
Allocation* Nondiscretionary Discretionary Overall 
Nondiscretionary 248.44 171.43 208.94 
(50.46) (81.95) (78.00) 
n = 194 n = 20 n = 39 
Discretionary 273.39 225.17 249.58 
(35.94) (91.71) (72.86) 
n = 20 n — 20 n = 40 
Overall 261.23 198.60 229.52 
(44.85) (90.15) (77.70) 
n = 39 n = 40 n = 79 
Panel B: ANOVA on Employee Group Compensation 
Source of Variation SS df MS F° p! 
Between-Subjects 
Pool Size Condition (SC) 613,244 1 613,244 16.07 « 0.01 
Pool Allocation Condition (AC) 248,253 1 248,253 6.51 « 0.01 
SC x AC 34,106 1 34,106 0.89 « 0.35 
Group(SC x AC) 2,862,249 75 38,163 
Within-Subjects 
Period? 182,438 7 26,063 11.20 < 0.01 
Period x SC 173,781 7 24,826 10.67 < 0.01 
Period x AC 17,064 7 2,438 1.05 < 0.40 
Period x SC x AC 15,398 7 2,200 0.95 < 0.48 
Period x Group(SC x AC) 1,221,853 9325 2,327 


* Employee groups consist of two employees paired with one employer. Employee group compensation equals 
the sum of the compensation received by the two employees. 

» We manipulated the determination of the compensation pool size at two levels: nondiscretionarily set at 2.36 
francs (cents) per unit of output or discretionarily set by the employer. In the nondiscretionary case, employees 
in total received two-thirds of revenue (3.54 per unit of output) and the employer received one-third of 
revenue. 

° We manipulated the allocation of the compensation pool between the two employees at two levels: a 
nondiscretionary equal allocation (50 percent to each employee) or a discretionary allocation by the employer. 

dn = the number of groups in each treatment condition. 

° The Group(SC X AC) mean square is the appropriate error term for the between-subjects effects. The Period 
X Group(SC X AC) mean square is the appropriate error term for the within-subjects effects. 

f We report a one-tailed p-value for the main effect of pool size and pool allocation, given the directional 
predictions for these effects. All other p-values are two-tailed. 

s We manipulated Periods as a within-subjects factors with eight levels, 1-8. Participants in all contract 
conditions completed eight work sessions. | 


that the mean group compensation of 249.58 for the discretionary pool allocation condition 
was greater than that of 208.94 for the nondiscretionary pool allocation condition (F 
= 6.51, p < 0.01). 

Panel B of Table 2 indicates a statistically significant main effect for period (F = 11.20, 
p < 0.01) and a statistically significant period-by-pool size discretion interaction (F 
= 10.67, p < 0.01). These interactions are reflected in trends in compensation over the 
eight periods shown in Panel B of Figure 1. The statistically significant period-by-pool size 
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discretion interaction reflects both decreases in group performance (output) and increases 
in employers' opportunistic behavior in the discretionary conditions over time. On average, 
employers in the discretionary pool size conditions paid total employee compensation of 
59 percent of revenue, compared to the predetermined 67 percent paid in the nondiscre- 
tionary pool size conditions. However, as suggested by the period-by-size discretion inter- 
action, the percentage paid declined over time. For example, in period 1, employers in the 
two discretionary pool size conditions on average allocated 62 percent of total revenue to 
employees, while in period 8, they allocated only 51 percent to employees. 


Employer's Residual Profit (Hypotheses 3a and 3b) 


Hypothesis 3a predicted that the employer's residual profit would be greater in the 
nondiscretionary size conditions than in the discretionary size conditions. Hypothesis 3b 
predicted that the employer's residual profit would be greater in the discretionary allocation 
conditions than in the nondiscretionary allocation conditions. Table 3 and Panel C of Figure 
1 present results for the employer's residual profit; these results do not support H3a but do 
support H3b. 

As shown in Panel A of Table 3, employers in the nondiscretionary pool size conditions 
averaged compensation of 130.65, compared to 139.65 in the discretionary pool size con- 
ditions. Contrary to H3a, the main effect of pool size discretion is not statistically signifi- 
cant, as indicated in Panel B (F — 1.50, p « 0.88). Panel A also indicates that the mean 
employer compensation was 142.25 in the discretionary pool allocation conditions and 
127.98 in the nondiscretionary pool allocation conditions. This difference is statistically 
significant (F — 3.68, p « 0.03), consistent with H3b. 

As indicated in Panel C of Figure 1, there is considerable variation in employers' 
compensation across the four contracts and eight periods. This variability is reflected in the 
statistically significant period-by-pool allocation discretion interaction in Panel B of Table 
3 (F = 2.55, p < 0.02). The results in Panel C of Figure 1 suggest that there is little 
difference across contracts in employer's residual profit until the last two periods, where 
profit appears greater in the conditions in which the employer had allocation discretion. We 
examine this issue further in the supplemental analysis below. 


Supplemental Analyses 


To provide further insights on our results, we performed two additional analyses. First, 
we reran all three ANOVAs using only periods 3—6 to eliminate both learning effects in 
periods 1—2 and end-game effects in periods 7—8. The ANOVAs for group performance 
and employee compensation showed statistically significant main effects for pool size dis- 
cretion and pool allocation discretion, similar to results in Tables 1 and 2; however, there 
were no main or interactive effects related to periods. These analyses indicate that thé effects 
of pool size and allocation discretion documented in Tables 1 and 2 are not simply learning 
or end-game effects. 

The ANOVA for periods 3—6 for employer's residual profit showed no statistically 
significant main or interactive effects. An ANOVA for periods 1—6 had similar results. 
Thus, it appears that the statistically significant effect of pool allocation discretion presented 
in Table 3 arises from results in periods 7—8 and likely represents end-game effects. That 
is, the employer's ability to differentially allocate compensation among employees appears 
to benefit the employer primarily near the end of the employment relation. 

Second, we performed an ANCOVA for our three dependent variables, adding the group 
piece rate from the prior period as a covariate. This piece rate was set at 2.36 francs in the 
two nondiscretionary size conditions, but varied based on the employer's choice in the 
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TABLE 3 
Employer Residual Profit (H3a and H3b) 


Panel A: Means (Standard Deviations) for Employer Residual Profit over All Periods* 








Compensation Compensation Pool Size” 
Pool 
Allocation* Nondiscretionary Discretionary Overall 
Nondiscretionary 124.28 131.49 127.98 
(25.27) | (41. 2 (34.53) 
n= 19 n= n = 39 
Discretionary 136.70 n 142.25 
(17.97) (41.09) (31.80) 
n = 20 n = 20 n = 40 
Overall 130.65 139.65 135.20 
(22.44) (41.77) (33.74) 
n = 39 n = 40 n = 79 
Panel B: ANOVA on Employer Residual Profit 
Source of Variation SS df MS F* p! 
Between-Subjects 
Pool Size Condition (SC) 13,241 1 13,241 1.50 < 0.88 
Pool Allocation Condition (AC) 32,594 1 32,594 3.68 < 0.03 
SC x AC 603 1 603 0.07 « 0.80 
Group(SC X AC) 664,058 75 8,854 
Within-Subjects 
Period? 17,201 7 2,457 1.42 < 0.20 
Period x SC 16,467 7 2,352 1.36 < 0.23 
Period x AC 30,881 7 4,412 2.55 < 0.02 
Period x SC x AC 16,385 7 2,341 1.35 < 0.23 
Period x Group(SC x AC) 908,688 525 1,731 


* Each employer was paired with two employees. An employer's residual profit equals total revenue less 
employee compensation. 

b We manipulated the determination of the compensation pool size at two levels: nondiscretionarily set at 2.36 
francs (cents) per unit of output or discretionarily set by the employer. In the nondiscretionary case, employees 
in total received two-thirds of revenue (3.54 per unit of output) and the employer received one-third of 
revenue. 

* We manipulated the allocation of the compensation pool between the two employees at two levels: a 
nondiscretionary equal allocation (50 percent to each employee) or a discretionary allocation by the employer. 

dn = the number of groups in each treatment condition. 

° The Group(SC X AC) mean square is the appropriate error term for the between-subjects effects. The Period 
X Group(SC x AC) mean square is the appropriate error term for the within-subjects effects. 

f We report a one-tailed p-value for the main effect of pool allocation given the directional predictions for this 
effect. The p-value for the main effect of pool size equals (1 — a/2), given that the results are in the opposite 
direction of that predicted in H3a. All other p-values are two-tailed. 

s We manipulated Periods as a within-subjects factors with eight levels, 1—8. Participants in all contract 
conditions completed eight work sessions. 


discretionary conditions, with an average of 2.09 francs. If employees simply respond to 
the size of the compensation pool in choosing their work efforts and resulting performance, 
then it is possible that differences in piece rates between the discretionary and nondiscre- 
tionary pool size conditions could lead to our results, rather than employer discretion versus 
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nondiscretion, per se. The ANCOVA tests for this possibility by relating employee per- 
formance to the effects of pool size and pool allocation discretion after controlling for any 
effects of differences in group piece rates. 

The prior-period piece rate covariate was statistically significant in all three ANCOVAs. 
However, the inferences for effects of pool size and allocation discretion did not change 
between the ANOVAs in Tables 1—3 and the related ANCOVAs. In the ANCOVAs for 
performance and employee compensation, the main effects for pool size discretion and pool 
allocation discretion were statistically significant at p « 0.01, as in Tables 1 and 2. The 
ANCOVA for employer residual profit showed a nonstatistically significant effect for size 
discretion and a statistically significant allocation discretion effect (p « 0.01), similar to 
results in Table 3. Overall, these findings suggest that our results in Tables 1—3 reflect the 
effects of differences in employer pool size discretion, as opposed to differences in group 
piece rates between the discretionary and nondiscretionary pool size conditions. 


V. SUMMARY AND DISCUSSION 

In this paper, we investigate implications of employer discretion over the total amount 
and allocation of employee compensation for firm output and employer/employee remu- 
. neration. We find that a nondiscretionary approach to determining total employee compen- 
sation and a discretionary approach to allocating employee compensation increase output 
and employee pay. The employer's residual profit increases with discretion over pool al- 
location, but does not increase with a nondiscretionary approach to determining total em- 
ployee compensation. Additionally, our results indicate that employer and employee op- 
portunism do not completely destroy productive outcomes in the absence of mitigating 
controls; however, the negative effects of opportunism increase over time, as the employ- 
ment relationship nears its end. 

Our paper has several implications for the role of compensation-based control systems 
in mitigating potential employer and employee opportunism. First, our paper provides a 
benchmark for the importance of control systems by documenting the effects of opportun- 
ism in the absence of mitigating controls. Our results show ‘that employers and employees 
do not engage in extreme opportunism in the absence of controls, suggesting that workplace 
norms lead employees to work and employers to reward this work to their mutual benefit. 
These findings highlight the need to compare the costs and benefits of compensation sys- 
tems not only with other control systems, but also with naturally occurring social mecha- 
nisms associated with employment settings. 

Second, our results support the use of compensation-based control systems since some 
opportunism exists in their absence. À compensation system based on nondiscretionary 
compensation pool size and discretionary pool allocation increases firm output. Employees 
also receive greater pay from such a compensation system; however, that employers do not 
profit from relinquishing discretion over the size of the compensation pool. This result 
suggests that employers may decide to retain discretion over aspects of compensation to 
the possible detriment of total firm output and employee compensation. However, compe- 
tition among employers for employees could lead to employers relinquishing the right to 
set the compensation pool size. 


13 Some research has found synergies between formal control systems and workplace norms. Specifically, formal 
control systems can enhance the trust that serves as the foundation for cooperative behavior in business settings 
(Coletti et al. 2005). 
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Our paper also complements research on the use of contractible and noncontractible 
information in compensation contracts. Our empirical results support the analytical conclu- 
sions of Baiman and Rajan (1995) regarding the benefits of a contract in which total 
employee compensation is based on contractible information and the allocation of this 
compensation is based in part on noncontractible information. Since much of a firm's 
contractible information emanates from the accounting system, our paper also supports the 
idea that publicly available accounting information and employer's private information have 
complementary roles in performance evaluation and compensation. 

Future research could provide additional insights on issues examined in this paper. 
First, as indicated earlier, we made specific choices for compensation parameter values; 
future research could examine the sensitivity of our results to our compensation parameter 
values. Specifically, research could examine different piece rates and sharing rules among 
employers and employees to determine if our results are robust across different 
parameterizations. 

Second, future research could examine other aspects of our design choices, such as 
anonymous two-employee groups and use of one noisy signal. Employee shirking could 
increase with larger groups of employees, strengthening the need for compensation-based 
control systems.! In contrast, the effectiveness of discretion over pool allocation might be 
reduced when employees are not anonymous if employers compensate employees on per- 
sonal characteristics unrelated to performance (Ittner et al. 2003). Research also could 
investigate the efficacy of discretionary allocation when employers must arrive at subjective 
evaluations of employee performance by aggregating several performance signals. Research 
has documented biases in combining such information (Lipe and Salterio 2000; Prendergast 
and Topel 1993). Future research could examine whether these biases overcome the benefits 
of the NDS-DA contract compared to the NDS-NDA contract. 

Finally, future research could distinguish between effects of nondiscretion versus com- 
mitment in determining the size of the compensation pool. In our experiments, employers 
who did not have discrotion were by debónition cornu: iv a compensation contract. 
However, employers can co:unuüt ex ante to a Sonia: based on contractible information 
and still retain discretion over characteristics of the compensation contract, such as its 
functional form, specific parameters, and length. While we find that employers do not 
benefit by relinquishing discretion over the size of the compensation pool, it is possible 
that employers do benefit from committing to a compensation arrangement over which they 
have discretion to set the terms as suggested by Baiman and Rajan (1995). However, an- 
alytic research assumes that employers know employees' utility functions and, therefore, 
can set contract terms optimally. In practice, the task of understanding ex ante precisely 
what motivates employees may be difficult. Thus, commitment may not benefit an employer 
compared to the situation of retaining full discretion. Future research on these issues will 
increase our understanding of how specific characteristics of employment contracts mitigate 
employee and employer opportunism. 
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ABSTRACT: We analyze how investors and information intermediaries perceive auditor 
tenure. Using earnings response coefficients from returns-earnings regressions as a 
proxy for investor perceptions of earnings quality, we document a positive association 
between Investor perceptions of earnings quality and tenure. Further, we find that the 
influence of reported earnings on stock rankings becomes larger with extended tenure, 
although the association between debt ratings and reported earnings does not vary 
with tenure. Finally, we find that the Influence of past earnings on one-year-ahead 
earnings forecasts becomes greater as tenure Increases. In general, our results are 
consistent with the hypothesis that investors and Information intermediaries perceive 
auditor tenure as improving audit quality. One implication of our study is that Imposing 
mandatory limits on the duration of the auditor-client relationship might impose unin- 
tended costs on capital market participants. 
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I. INTRODUCTION 

he recent rise in accounting irregularities has reopened questions about auditor ten- 

ure, independence, and audit quality (Bricker 2002).' Recent studies provide valuable 

insights into the debate surrounding auditor tenure by examining the association 
between tenure and (1) accounting accruals, (2) analysts' forecast errors, and (3) the cost 
of debt. Myers et al. (2003) conclude that longer auditor tenure constrains managerial 
discretion with accounting accruals, which suggests high audit quality. Johnson et al. (2002) 
also find that accruals are larger and less persistent for firms with short auditor tenure 
relative to those with medium or long tenure. Using credit spreads between bond yields 
and matched Treasury yields as the cost of debt, Mansi et al. (2004) find that the cost of 
debt declines with longer tenure, which suggests bondholders perceive audit quality as 
improving with extended tenure. In contrast, Davis et al. (2002) conclude that audit quality 
declines with extended tenure because, as tenure increases, client firms have greater re- 
porting flexibility and earnings forecast errors decline. 

This study focuses on how investors and information intermediaries perceive auditor 
tenure. Using earnings response coefficients from returns-earnings regressions as a proxy 
for investor perceptions of earnings quality, we analyze whether investors perceive earnings 
quality as being affected by tenure. Given the importance of information intermediaries 
who receive and process financial information for investors, we also analyze whether in- 
dependent rating agencies and financial analysts incorporate the potential effects of tenure 
on earnings quality. Specifically, we examine whetber tenure affects the relationship be- 
tween reported earnings and (1) stock rankings, (2) debt ratings, and (3) analysts' earnings 
forecasts. 

A key distinction between our study and those of Davis et al. (2002) and Mansi et al. 
(2004) is the difference in the research design. We examine whether the extent to which 
analysts rely on past reported earnings to predict future earnings varies with tenure, whereas 
Davis et al. (2002) explore the association between forecast errors and tenure. Our research 
design might be better suited because it is difficult to draw unambiguous inferences about 
analysts' perceptions from an association between forecast errors and tenure. Lower forecast 
errors with longer tenure might suggest that earnings quality is perceived as improving with 
tenure because earnings are more predictable. However, it could also suggest lower earnings 
quality if managers increasingly guide earnings forecasts as auditor tenure lengthens. Fur- 
ther, Mansi et al. (2004) investigate the influence of tenure on the cost of debt, whereas 
we examine whether the influence of reported earnings on debt ratings varies with tenure. 
Although Mansi et al. (2004) examine the effect of tenure on debt ratings, their primary 
aim is to purge (orthogonalize) the information effects of tenure and other control variables 
related to debt ratings and ultimately examine the insurance role of tenure on the cost of 
debt. In contrast, our emphasis is on the informational role of tenure. Our fundamental 
objective is to provide insights into any changes in the perceived credibility of reported 
earnings with extended auditor tenure. 

Emphasis on capital market perceptions of independence and audit quality is consistent 
with the Financial Accounting Standards Board's (FASB) conceptual framework for finan- 
cial reporting and principles of auditor independence (Carmichael 1999). According to the 


! Regulators express concerns that pressure to retain client firms and the “comfort level” created between auditors 
and management over time impair auditor independence, which adversely affects audit quality (U.S. Government 
Accounting Office [GAO] 2003). The accounting profession, on the other hand, claims that the likelihood of 
audit failures is greater during the initial period of an auditor-client relationship because of lack of information 
about client-specific risks (PricewaterhouseCoopers 2002). 
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Statement of Financial Accounting Concepts (SFAC No. 1, FASB 1978), “financial state- 
ments are often audited by Independent accountants for the purpose of enhancing confidence 
in their reliability.” The AICPA (1994) also acknowledges the importance of considering 
investor perceptions of auditor independence. A former chairman of AICPA, Elliott (2000) 
says "[The AICPA] believe[s] that appearances are very important and capital markets 
require confidence in financial statements and audit reports, and the member firms of the 
AICPA are basing their business of auditing on their reputations, and that is heavily affected 
by appearance." 

Other things remaining constant, audited financial statements are less reliable (or of 
lower perceived quality) for investment and credit decisions if users of financial statements 
view lengthy tenure as having an adverse effect on auditor independence and audit quality. 
Alternatively, investors and information intermediaries are more likely to rely on reported 
accounting numbers if they perceive that greater auditor expertise from longer tenure im- 
proves independence and audit quality. We assert that the relationship between perceptions 
of earnings quality and auditor tenure provides insights into how capital market participants 
view auditor tenure as affecting audit quality. 

We use earnings response coefficients (ERCs) from contemporaneous returns-earnings 
regressions to measure investor perceptions of earnings quality. After controlling for the 
other determinants of ERC— such as the age of the firm, the quality of auditors, growth, 
earnings persistence, earnings volatility, systematic risk, firm size, financial leverage, and 
regulatory environment—we find that the magnitude of the ERC increases as the auditor- 
client relationship lengthens. One concern is that if the market anticipates a portion of 
current earnings more than one-year-ahead of the earnings release, then the estimated ERC 
might be biased downward (Kothari 1992; Kothari and Sloan 1992). Additionally, if the 
market is more likely to anticipate current earnings for firms with longer tenure, then the 
bias might be associated with tenure. Consistent with the premise that prices lead earnings, 
the estimated ERC for firms with extended tenure is larger when we increase the returns 
measurement window or when we use non-market metrics, such as total assets, to deflate 
earnings. Thus, the ERC-based findings are consistent with the hypothesis that investors 
perceive auditor tenure as enhancing earnings quality. 

Further, controlling for the other determinants of stock rankings, we find that the influ- 
ence of reported earnings on Standard & Poor’s (S&P) common stock rankings becomes 
larger with extended tenure. In contrast, the association between S&P debt ratings and . 
reported earnings does not vary with tenure. Thus, our results provide modest evidence that 
independent rating agencies perceive reported earnings as being more reliable for firms with 
longer tenure. | 

Finally, after controlling for factors that affect analysts' earnings forecasts, we find that 
the influence of past earnings on one-year-ahead earnings forecasts becomes larger over 
extended auditor-client relationships. All else equal, analysts are more likely to rely on 
reported earnings to predict future earnings with longer tenure. Thus, the results from 
analyst earnings forecasts also suggest that analysts perceive earnings quality as improving 
with longer auditor tenure. 

While our results are based on a large sample of firms spanning 11 years, there is one 
concern with our dataset. If auditor tenure is endogenous to audit quality, then the results 
are also consistent with an alternative hypothesis that auditor turnover is high for firms 
with low earnings quality. In other words, high-quality auditors might terminate engage- 
ments with client firms that prefer low-quality financial statements (DeFond and 
Subramanyam 1998). We address potential concerns about frequent auditor turnover by 
constructing a subsample for which the auditor-client relationship lasts for at least five 
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years, as in. Myers et al. (2003). When we use this subsample, our inferences remain 
unchanged for all our tests. 

In sum, our results are generally consistent with the hypothesis that reported earnings 
are perceived as being more reliable as auditor tenure increases. One implication of our 
results is that capital market participants view longer auditor tenure as having a favorable 
impact on audit quality. Our results suggest that imposing mandatory limits on the duration 
of the auditor-client relationship might impose unintended costs on capital markets. How- 
ever, results from a regime without auditor term limits may not be applicable to a regulated 
environment because of differences in economic incentives for both auditors and clients. 

The rest of the paper is organized as follows. Section II establishes the links between 
tenure, independence, and perceptions of audit quality. Section IIT discusses the research 
design, and Section IV describes the sample selection procedure. Section V reports the 
results of the association between tenure and perceptions of earnings quality. Section VI 
concludes the paper. 


II. TENURE, INDEPENDENCE, AND PERCEPTIONS OF AUDIT QUALITY 

Independent auditors are considered the "gatekeepers" of the public securities markets 
(SEC 2001, IIL A). However, the recent rash of accounting irregularities has led many to 
question auditor independence (Wall Street Journal 2002a, 2002b). One perception is that 
auditors are more likely to agree with managers on important reporting decisions as the 
length of the audit engagement increases (Ryan et al. 2001; Farmer et al. 1987). Therefore, 
imposing mandatory limits on auditor tenure is expected to improve audit quality by re- 
ducing client firms' influence over auditors (Turner 2002; Brody and Moscove 1998; SEC 
1994; AICPA 1978; U.S. Senate 1977; Mautz and Sharaf 1961). 

An opposing viewpoint is that problem audits occur more frequently for newer clients 
because auditors have less information about these firms (AICPA 1992). Client-specific 
knowledge of items such as operations, accounting system, and internal control structure 
is crucial for auditors to detect material errors and misstatements. In particular, Johnson et 
al. (2002) argue that lack of adequate client-specific knowledge during the early years of 
engagement decreases the likelihood of detecting material errors and misstatements. As the 
auditor-client relationship lengthens, firm-specific expertise allows auditors to rely less on 
managerial estimates and become more independent of management (Solomon et al. 1999). 

The crux of the debate rests on how tenure affects auditor independence. Proponents 
of mandatory auditor rotation claim that lengthy auditor tenure erodes independence, which 
in turn impairs audit quality. Others argue that independence and audit quality increase 
with longer tenure because of improved auditor expertise from superior client-specific 
knowledge.” Since independence is not observable, regulators, practitioners, and academics 
often rely on the appearance dimension to define auditor independence (Dopuch et al. 2003; 
Kinney 1999). 

Our study investigates whether capital market participants perceive longer tenure as 
affecting audit quality. Insights from a market-based approach are important along at least 
two dimensions. First, academics often emphasize the need to understand capital market 
perceptions of auditor independence and audit quality because ultimately the value of au- 
diting services depends on perceptions of independence (Dopuch et al. 2003; Shockley 


2 Expert knowledge gained through years of on-the-job experience increases the likelihood that auditors will detect 
errors in financial statements (Ashton 1991; Libby and Frederick 1990). In contrast, industry specialization, 
another frequently used proxy for auditor expertise, is based on training and practical experience gained from 
auditing in a particular industry (Gramling and Stone 2001; Solomon et al. 1999; Hogan and Jeter 1999; Craswell 
et al. 1995). 
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1981). Second, regulatory institutions such as the FASB (SFAC No. 1) and SEC (2000) 
emphasize the importance of capital market perceptions of auditor independence. À per- 
ception that the auditors' work is more objective and independent inspires greater confi- 
dence in auditor opinion, which increases the perceived reliability or quality of reported 
accounting numbers (Ryan et al. 2001; Elliott and Jacobson 1998). Independent auditors 
increase the reliability of financial statements because (1) they are more likely to prevent 
or detect and correct material misstatements/omissions, and (2) they ensure that financial 
statements comply with generally accepted accounting principles (Carmichael 1999). There- 
fore, to the extent that capital market participants view tenure as improving independence 
and audit quality, financial statements are perceived as more reliable for financial decisions 
as tenure lengthens. 

In this study, we focus on the perceptions of investors because they are the principal 
users of financial statements. In its Conceptual Framework, the FASB defines "quality" in 
relation to the usefulness of financial statements to investors and links "usefulness" in turn 
to constructs such as relevance and reliability (SFACs No. 1 [FASB 1978] and No. 2 [FASB 
1980]). To draw inferences about investors' perceptions of earnings quality, researchers tend 
to use stock-market-based metrics such as earnings response coefficients from regressions 
of returns on earnings (Schipper and Vincent 2003; Warfield et al. 1995). Prior studies 
document that investors pay a larger premium for "high-quality" earnings because high- 
quality earnings are viewed as sustainable (Schipper and Vincent 2003; Teoh and Wong 
1993). Thus, examining the influence of auditor tenure on the pricing of earnings is likely 
to provide valuable insights into investors’ views of the association between the length of 
the auditor-client relationship and earnings quality. 

Given that information intermediaries constitute an integral part of the capital market 
by providing stock recommendations, debt ratings, and earnings forecasts (Lang and 
Lundholm 1996), we also analyze how independent rating agencies and financial analysts 
view auditor tenure. Hunt (2002) states that independent rating agencies provide information 
about the creditworthiness of issuers and that credit ratings play a significant role in in- 
vestment decisions. Extant research finds links between earnings and debt ratings/stock 
rankings issued by independent rating agencies (Bhojraj and Sengupta 2003; Ziebart and 
Reiter 1992; Van Horne 1992; Kaplan and Urwitz 1979), suggesting that perceptions of 
earnings quality could be an important input in determining rankings/ratings. Similarly, 
financial analysts also play a prominent role as information intermediaries in capital markets 
because of their ability to incorporate value-relevant information in their published reports, 
which impacts security prices (Francis and Soffer 1997; Schipper 1991; Lys and Sohn 1990; 
Brown et al. 1987). Prior research finds that analysts rely on earnings releases to estimate 
future earnings (Kasznik and McNichols 2002; Barron et al. 2002; Stickel 1989), which 
suggests that the extent to which analysts depend on reported earnings to make earnings 
forecasts might vary with perceptions of earnings quality. 

Extending this line of research, we examine the association between auditor tenure and 
(1) stock rankings, (2) debt ratings, and (3) analysts' forecasts of earnings per share. Ad- 
ditionally, we analyze how auditor tenure affects the association between reported earnings 
and rankings, ratings, and earnings forecasts. If auditor tenure is perceived as enhancing 
earnings quality, then, all else equal, the influence of reported earnings on rankings/ratings 
and earnings forecasts is expected to become larger with longer auditor tenure because 
reported earnings are viewed as more informative about future earnings. The converse is 
true if information intermediaries perceive longer auditor tenure as eroding earnings quality. 

Based on the association between our proxy measures for perceptions of earnings qual- 
ity and auditor tenure, we infer how investors and information intermediaries view auditor 
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tenure as affecting audit quality. A positive relationship between the proxies for perceptions 
of earnings quality and auditor tenure is consistent with the hypothesis that longer tenure 
is perceived as improving independence and audit quality. In contrast, a negative relation- 
ship is consistent with the hypothesis that lengthy tenure is viewed as eroding independence 
and audit quality. 


IIl. RESEARCH DESIGN 
We use the following basic regression framework to analyze whether investors, inde- 
pendent rating agencies, and financial analysts perceive earnings quality as being affected 
by auditor tenure: 


Dependent variable = a + B E + BAE + B,E*Tenure + B,AE*Tenure 


9 
+ BsTenure + > Bs.2u- 5 E* Control variable, 
jul 
9 
+ > Bz..24-:; AE*Control variable, 
J=1 
9 
+ > B5,, Control variable, + €. (1) 
=I 


E and AE are reported earnings and changes in reported earnings, respectively. Tenure is 
the duration of the auditor-client relationship (calculated using Compustat #149) in years, 
starting from 1982 (audit firm mergers are considered as a continuation of the prior auditor). 
Subscript j represents the number of control variables (1 to 9). Each of the control variables 
is interacted with E/AE and is also included as a separate independent variable. Details on 
the dependent and control variables are provided in subsequent subsections. 

The sum of earnings levels and changes coefficients (B, + B5) or the “earnings response 
coefficient" (ERC) is our proxy for capital markets' perceptions of earnings quality. Our 
interest is in the sum of the E*Tenure and AE*Tenure coefficients (B, + B4). If investors, 
rating agencies, and analysts perceive earnings quality as improving (declining) with longer 
auditor tenure, B, + B, is expected to differ from zero. 


Perceptions of Investors and Auditor Tenure 

We measure investor perceptions of earnings quality using 12-month (ending three 
months after the fiscal year-end) cumulative market-adjusted returns (CAR) as the dependent 
variable in Equation (1). Market-adjusted returns are the difference between raw returns 
and value-weighted CRSP market returns. 

We include earnings changes and earnings levels in the same regression because in- 
cluding both increases the explanatory power and magnitude of earnings response coeffi- 
cients when earnings contain both transitory and permanent components (Easton and Harris 
1991; Ali and Zarowin 1992). E is income before extraordinary items (Compustat #18) and 
AE is the difference between income before extraordinary items for the current year and 
that of last year. Both E and AE are deflated by market value of equity (Compustat #25 
X #199) at the beginning of the year. 

We include various control variables because the ERC is associated with other firm 
characteristics (Warfield et al. 1995; Dhaliwal and Reynolds 1994; Collins and Kothari 
1989), which, in turn, might be associated with tenure. The control variables are defined 
as follows: FirmAge, computed using the beginning and end dates as reported in CRSP, 
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measures the number of years that the firm has been publicly traded as of the fiscal year- 
end; Big4 is an indicator variable that equals 1 when the client's auditor is a large account- 
ing firm (Compustat #149); Growth is the sum of the market value of equity (Compustat 
#25 X #199) and the book value of debt (Compustat #9 + #34) scaled by the book value 
of total assets (Compustat #6); Persistence (Volatility) is the first-order autocorrelation (stan- 
dard deviation) of income before extraordinary items per share (Compustat Quarterly #8/ 
Quarterly #61) for the past 16 quarters;? Beta is systematic risk computed using the past 
60 monthly stock returns; Size is the logarithmic transformation of the fiscal year-end 
market value of equity (Compustat #25 x #199) of the prior year; Leverage is the ratio of 
total debt (Compustat #9 + #34) to total assets (Compustat #6); and Regulation is an 
indicator variable that equals 1 for firms in a regulated industry with two-digit standard 
industry classification codes between 40 and 49 or between 60 and 63. 

We include FirmAge for two reasons: (1) older firms are more likely to be stable with 
less information asymmetry problems, which suggests higher ERCs; and (2) Tenure and 
FirmAge are positively correlated. We control for Big4 because large auditors are generally 
associated with high-quality audits (Becker et al. 1998; Teoh and Wong 1993). The inclu- 
sion of Growth, Persistence, Volatility, and Beta is primarily motivated by valuation con- 
siderations (Warfield et al. 1995). The inclusion of Size is motivated by the political cost 
theory: managers of large, politically sensitive firms are more likely to exploit the latitude 
in accounting to reduce political costs, which affects earnings quality. We include Leverage 
because of contracting considerations: firms with high leverage are more likely to use the 
latitude in accounting to avoid possible debt-covenant violations (DeFond and Jiambalvo 
1994). Finally, the quality of earnings is affected in a regulated environment (Regulation) 
because managers have limited scope for opportunistic behavior (Warfield et al. 1995). 


Perceptions of Information Intermediaries and Auditor Tenure 
Independent Rating Agencies 

Our analysis of how tenure affects independent rating agencies' perceptions of earnings 
quality is based on estimates from Equation (1) using Standard & Poor's (S&P) common 
stock rankings (Stock Rankings) and senior debt ratings (Debt Ratings) as dependent vari- 
ables. Stock Rankings represent numerical values of 1 to 7, corresponding to S&P common 
stock rankings (Compustat #282). A value of 1 is assigned if a firm's common stock ranking 
is rated as A+ and as S&P common stock rankings decline, the numerical value increases 
by 1.4 Similarly, Debt Ratings are assigned a value of 1 if a firm's S&P senior debt (Com- 
pustat #280) is rated as AAA. As S&P debt ratings decline from AAA (the highest) to D 
(payment default), the numerical value increases by 1. 

We include Growth, Volatility, Beta, Size, and Leverage to control for cross-sectional 
differences in firm quality and riskiness (Bhojraj and Sengupta 2003; Van Horne 1992; 
Ziebart and Reiter 1992; Kaplan and Urwitz 1979). Growing firms with high earnings 
volatility tend to be more risky, Beta controls for both operating and financial risk, large 
firms tend to be less risky, and firms with higher leverage have greater financial risk. We 
also include FirmAge, Big4, Persistence, and Regulation because the association between 
reported earnings and rankings/ratings might depend on the firm's life-cycle, whether firms 


3 Since Persistence and Volatility measures are computed using time-series income per share numbers, we adjust 
for stock-splits and stock dividends that occur subsequent to the end of a given period using the adjustment 
factor reported in Compustat (Quarterly #17). 

^ For instance, values 2 through 7 correspond to the S&P common stock rankings of A, A-, B+, B, B-, and C, 
respectively. We do not include S&P common stock rankings of D (in reorganization) and LIQ (liquidation) for 
our sample because of going-concern issues. 
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use big audit firms, whether earnings are persistent, and whether firms are in a regulated 
environment. | 

In a recent study, Mansi et al. (2004) find that longer tenure lowers the rate of return 
required by bondholders and that the impact is larger for “information-sensitive” securities 
such as non-investment grade bonds (debt ratings less than BBB-—). Therefore, we also 
separately estimate the effects of tenure on investment and non-investment grade debt 
ratings. 


Financial Analysts 

Finally, our analysis of whether tenure influences financial analysts’ perceptions of 
earnings quality is based on the estimates from Equation (1) using earnings forecasts 
(FEPS,) as the dependent variable. FEPS, is the consensus analyst forecasts of annual 
earnings per share from the Institutional Brokers Estimation System (I/B/E/S) database. 
Our proxy for consensus forecasts is the mean one-year-ahead forecast for year t issued 
immediately following the earnings announcement for year t—1 (earnings announcement 
dates are obtained from Compustat)? We restrict the forecasts to those issued immediately 
following earnings announcements, as in Kasznik and McNichols (2002) and Barron et al. 
(2002), because (1) any change in analysts' perceptions of earnings quality is more likely 
to be updated following earnings releases (Barron et al. 2002), (2) it excludes stale forecasts 
Since forecast revisions are more common following earnings announcements (Stickel 
1989), and (3) it conditions the forecasts on the same set of publicly disclosed information. 

Reported earnings per share (EPS, , and AEPS, ,) are used as measures for E and AE 
in Equation (1). EPS, , is annual earnings per share reported for year t—1, and AEPS, , is 
the absolute change in earnings per share for year t—1 defined as the difference in annual 
earnings per share in year t—1 and that in year t—2 (|EPS, , — EPS, JD), both obtained 
from the I/B/E/S database. As in Barron et al. (2002), we include the absolute value of 
changes in earnings because a larger earnings surprise tends to be temporary, which reduces 
the potential usefulness of past earnings in predicting future earnings. 

Since prior studies find that analysts' incentives to acquire information about future 
earnings are affected by firm characteristics (DeFond and Hung 2003; Barron et al. 2002; 
Lang and Lundholm 1996), we include the following control variables: FirmAge because 
older firms are more likely to be stable and consequently earnings might be easier to predict; 
auditor type (Big4) because analysts might perceive earnings quality as improving for firms 
with big auditors; Growth because high-growth firms tend to generate greater demand for 
private information, thereby reducing the reliance on reported earnings (Barron et al. 2002; 
Lang and Lundbolm 1996); Volatility because analysts are more likely to attach lower 
importance on reported earnings for firms with higher earnings volatility (DeFond and Hung 
2003); risk using a market-based measure (Beta) and a balance-sheet-based measure (Lev- 
erage); Size because prior studies find that firm size is associated with risk and the infor- 
mation environment, which affects earnings predictability (DeFond and Hung 2003; Barron 
et al. 2002); Regulation because the predictability of earnings might vary between industries 
(O'Brien 1990); and the number of analysts (Analysts) providing annual earnings per share 


ta 


We prefer consensus forecasts to individual forecasts because prior research finds that consensus forecasts are 
more accurate than the individual forecasts underlying the consensus. One explanation for this result is that the 
forecast errors made by individual analysts are less than perfectly correlated and therefore in the process of 
aggregation, the individual forecast errors offset each other (see Brown et al. [1985] for related papers). The 
results are similar when we use either the mean or the median forecast. 
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forecasts for a firm (Lang and Lundholm 1996). All the control variables including Tenure 
are measured at the end of year t— 1. 

In a recent study, Davis et al. (2002) find that forecast errors decline as tenure in- 
creases. However, it is difficult to draw unambiguous inferences about perceptions of 
analysts from an association between forecast errors and tenure. Lower forecast errors might 
suggest that analysts perceive earnings quality as improving with tenure because earnings 
are more predictable. Alternatively, lower forecast errors with longer tenure might suggest 
lower earnings quality if managers are more likely to guide earnings forecasts as auditor 
tenure lengthens.’ Hence, we do not focus on earnings forecast attributes such as forecast 
errors and dispersion. 


IV. DATA 

We construct a sample from the list of publicly traded firms in the 2001 Compustat 
annual files (active and research). Since the 2001 Compustat files cover 20 years of data, 
and financial data are available from 1982, auditor tenure is 1 for the first year by construc- 
tion. The analysis begins with 1990 to provide some variation for auditor tenure and ex- 
cludes the year 2001 because of possible missing and incomplete data.” Stock return and 
firm age data are obtained from the 2001 CRSP files. We get earnings forecast data from 
the 2001 Institutional Brokers Estimation System (I/B/E/S) Summary Estimates. 

We impose the following restrictions on the sample: (1) we delete the top and bottom 
] percent of observations for the level of earnings (E), changes in earnings (AE), annual 
earnings per share (EPS), and the absolute change in annual earnings per share (AEPS); 
(2) we remove all observations with the absolute value of cumulative market-adjusted re- 
turns (CAR) greater than 100 percent; and (3) we also winsorize the top and bottom 1 
percent of obsérvations for Growth, Persistence, Volatility, Beta, and Leverage.? This sample 
selection procedure results in a maximum of 38,794 observations for the "full" sample 
over the years 1990 through 2000. 

Some concerns remain with the full sample. Firms might frequently switch auditors 
either because of "opinion shopping" (SEC 1988) or because of auditors' preference for 
conservative accounting choices (DeFond and Subramanyam 1998). Alternatively, high- 
quality auditors might end engagements with clients that prefer low quality of financial 


Researchers frequently use forecast errors and dispersion among forecasts as proxies for the uncertainty among 
analysts about the quality of accounting information (Hope 2003; Duru and Reeb 2002; Ashbaugh and Pincus 
2001; Barron et al. 1998; Lang and Lundholm 1996). 

Managers. have strong incentives to avoid negative earnings surprises because of the large stock-price reactions 

to negative earnings (see Ghosh et al. 2005; Bartov et al. 2002; Myers and Skinner 2002; Skinner and Sloan 

2002; Barth et al. 1999). Firms can avoid earnings surprises by manipulating earnings to meet expectations or 

by guiding analyst expectations to avoid overly optimistic forecasts. Although existing studies suggest that 

earnings management declines with longer auditor tenure (Myers et al. 2003; Johnson et al. 2002), we do not 
know whether tenure affects managers' propensity to guide analvsts' expectations. 

* Our computation of Tenure will understate the actual length of auditor tenure because Tenure is 1 for firms in 
1982 even if the auditor was the same for the prior years. However, when we replicate our analysis computing 
Tenure from 1974 using information about the client's auditor as reported in Compustat backfiles, similar to 
Mansi et al. (2004), our conclusions remain unaffected. 

? The use of alternative cut-off points such as 0.5 percent also yields similar results, but it retains extreme 

observations. For instance, the mintmum of E is —541 percent and the maximum of AE is 629 percent when 

we delete the top and bottom 0.5 percent. We delete observations with absolute CARs greater than 100 percent 
to reduce the impact of major corporate restructuring activities such as mergers, bankruptcies, spin-offs, etc. 

However, when we delete the top and bottom 1 percent of CAR, we get similar results. The results on tenure 

are unaffected when we truncate the top and bottom 1 percent of observations for the control variables. 
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reporting. We avoid some of these biases by constructing a subsample in which the auditor- 
client relationship lasts for at least five years. For auditor-client relationships lasting less 
than five years, we delete all of the firm-year observations. The maximum number of 
observations for the "restricted" sample is 35,826 firm-years.!? 

Table 1 reports summary statistics for both the full sample (Panel A) and the restricted 
sample (Panel B). We report the mean, minimum, first quartile, median, third quartile, and 
maximum for Tenure, FirmÁge, CAR, E, AE, Stock Rankings, Debt Ratings, and FEPS. 
Auditor tenure is longer for the restricted sample than for the full sample; the mean/median 











TABLE 1 

Descriptive Statistics 
Variables — — Means Min 25% H% TSh Ma n 
Panel A: Full Sample 
Tenure (years) 8.549 1.000 5.000 8.000 12.000 19.000 38,794 
FirmAge (years) 17.597 3.417 7.000 12.167 233833 75.000 38,794 
CAR —0.060 —0.991 —0.343 -0.108 0.158 0.999 38,794 
E —0.007 —2.795 —0.022 0.048 0.083 0.523 38,794 
AE 0.006 —1.418 —0.032 0.006 0.034 2.815 38,794 
Stock Rankings 5.096 1.000 4.000 5.000 6.000 7.000 25,745 
Debt Ratings 9.147 1.000 6.000 9.000 12.000 23.000 9,462 
FEPS ($) 1.045 —4.140 0.450 0.890 1.490 7.250 16,417 
Panel B: Restricted Sample 
Tenure (years) 9.096 1.000 6.000 9.000 12.000 19.000 35,826 
FirmAge (years) 18.004 3.417 7.000 12.750 | 24.000 75.000 35,826 
CAR —0.054 —0.989 —0.332 —0.102 0.161 0.999 35,826 
E 0.000 —2.195 —0.013 0.050 0.084 0.523 35,826 
AE 0.006 —1.418 —0.030 0.006 0.033 2.804 35,826 
Stock Rankings 5.055 1.000 4.000 5.000 6.000 7.000 24,216 
Debt Ratings 9.087 1.000 6.000 9.000 12.000 23.000 9,060 
FEPS ($) 1.049 —4.140 0.450 0.890 1.500 7.250 15,699 


The full sample includes Compustat firms with available data from 1990 to 2000. The restricted sample consists 
of firms in the full sample with auditor-client relationships lasting for at least five years. Tenure is the duration 
of the auditor-client relationship in years starting from 1982. FirmAge measures the number of years that the 
firm has been publicly traded as of the fiscal year-end. CAR is cumulative market-adjusted returns for the 12- 
month period ending three months after the fiscal year-end. Market-adjusted returns are the difference between 


` raw returns and value-weighted CRSP market returns. E is income before extraordinary items deflated by market 


value of equity at the beginning of the year. AF is the difference between income before extraordinary items for 
the current year and that of last year deflated by market value of equity at the beginning of the year. Stock 
Rankings and Debt Ratings represent numerical values of S&P common stock rankings and S&P senior debt 
ratings, respectively. Stock Rankings (Debt Ratings) are assigned a value of 1 if a firm's S&P common stock 


(senior debt) is rated as A+ (AAA), and as the S&P common stock rankings (debt ratings) decline, the 


numerical value increases by 1. FEPS is the mean annual one-year-ahead earnings per share forecasts for year t 
issued immediately following the earnings announcement for year t—1. 


10 Although we use the Myers et al. (2003) methodology to construct the restricted sample, the percentage of 
observations deleted from the full sample is much smaller in our study. One primary reason for the difference 
is that we include many more variables in our analyses, which reduces the size of our full sample. Thus, fewer 
observations are lost from requiring that the auditor-client relationship lasts for at least five years in our study 
relative to those in Myers et al. (2003). 
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Tenure for the full sample is 8.5/8 years, while the corresponding number for the restricted 
sample is 9.1/9 years. The mean FirmAge for the full sample is 17.6 years, compared to a 
mean of 18.0 years for the restricted sample. FirmAge ranges between 3.4 years and 75 
years for the full and restricted samples. 

The summary statistics for CAR, E, and AE indicate few differences across the two 
samples. The mean/median CAR for the full sample is —0.060/ —0.108, while that for the 
restricted sample is —0.054/—0.102.!! The mean/median E for the full sample is —0.007/ 
0.048, compared to a mean/median of 0. 000/0.050 for the restricted sample. The mean/ 
median AE is identical for the full and restricted samples (0.006). 

Similarly, there are almost no differences in Stock Rankings, Debt Ratings, and earnings 
forecasts (FEPS) between the two samples. The average firm in both samples has a B S&P 
common stock ranking (a numerical score of 5), and a BBB S&P senior debt rating (a 
numerical score of about 9). The mean/median one-year-ahead analysts' earnings per share 
forecasts (FEPS) is $1.05/$0.89 and $1.05/$0.89 for the full and restricted samples, 
respectively. 

Table 2 reports the Pearson correlation (p) matrix between CAR, E, AE, Stock Rankings, 
Debt Ratings, FEPS, Tenure, and FirmAge for the full sample. The high correlation between 
Stock Rankings and Debt Ratings (p — 0.686) suggests that stock rankings and debt ratings 
issued by independent rating agencies contain significant overlapping information. However, 
low correlations across variables that measure investor, rating agency, and financial analyst 
perceptions indicate that analysis of each market participant's perceptions provides distinct 
insights into the overall perceptions of the quality of accounting information. Finally, the 
correlation between Tenure and FirmAge is positive and statistically significant (p — 0.337, 
p-value — 0.001), which underscores the need to control for FirmAge when analyzing 
Tenure. 


V. RESULTS 

All the regression results reported in Tables 3 to 7 are based on yearly regressions from 
1990 to 2000. The reported coefficients are the average of 11 yearly coefficients, and the 
corresponding t-statistics are computed by comparing the average coefficient to its time- 
series standard error. We prefer this approach to a pooled time-series cross-sectional esti- 
mation because the standard errors in the latter approach are not adjusted for the correlation 
of regression residuals across firms, which lead to inflated t-statistics. The estimation pro- 
cedure of year-by-year cross-sectional regressions and the use of average slopes and their 
time-series standard errors to draw inferences allow for residual cross-correlation (for de- 
tails, see Fama and French 2000). 


Perceptions of Investors and Auditor Tenure 


In this subsection, we report the findings of how auditor tenure affects the returns- 
earnings association. Our interest is in the sign and magnitude of the sum of the coefficients 
on E*Tenure and AE*Tenure (B, + B4). Consistent with prior research, we find that reported 
earnings (E and AE) are significantly positively associated with returns (CAR) in Table 3. 
The sum of the coefficients on E and AE, or ERC, is 0.507 (t-statistic — 10.30) in the first 


!! Prior studies also find negative mean CAR: (1) DeFond and Hung (2003) report that the mean CAR measured 
over fifteen months for a sample of 21,908 observations from 1993 to 1999 is -0.05 (—0.04) for firms with 
(without) I/B/E/S cash flow forecasts; (2) Subramanyam and Wild (1996) find that the mean CAR over one 
quarter for a sample of 25,160 observations from 1981 to 1990 is —0.01; and (3) Ali and Zarowin (1992) report 
that the mean 72-month CAR from 1970 to 1985 is —0.05 (—0.04) for the transitory (permanent) group with 
4,355 (7,219) observations. 
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TABLE 2 
Pearson Correlation Matrix 
Variables 1 2 3 4 5 6 7 8 
1. CAR 1.000 
2. E 0.262 1.000 
(0.001) 
3. AE 0.194 0.368 1.000 


(0.001) — (0.001) 

4. Stock Rankings ~-0.118 — —0.266 0.068 1.000 
(0.001) (0.001) (0.001 

5. Debt Ratings —0.185 —0.341  —0.028 0.686 1000 
(0.001) (0.001) (0.004) (0.001) 


6. FEPS —0.002 0.021  —0.003  —0.007  —0.080 . 1.000 
(0.802) (0.001) (0.652) (0.391) (0.001) 
7. Tenure 0.112 0.122 0.015  —0.169  -0.201 0.024 1.000 
(0.001) (0001 (0.001) (0.001) (0.001) (0.001) 
8. FirmAge 0.093 0.115 0010 —0291 —0428 0.025 0337 1.000 


(0001) (0.001) (0.026) (0.001) (0.001) (0.001) (0.001) 


Pearson correlation matrix for the full sample. CAR is cumulative market-adjusted returns for the 12-month 
period ending three months after the fiscal year-end, where market-adjusted returns are the difference between 
raw returns and value-weighted CRSP market returns. E is income before extraordinary items deflated by market 
value of equity at the beginning of the year. AE is the difference between income before extraordinary items for 
the current year and that of last year deflated by market value of equity at the beginning of the year. Stock 
Rankings and Debt Ratings represent numerical values of S&P common stock rankings and S&P senior debt 
ratings, respectively. Stock Rankings (Debt Ratings) are assigned a value of 1 if a firm's S&P common stock 
(senior debt) is rated as A+ (AAA), and as the S&P common stock rankings (debt ratings) decline, the 
numerical value increases by 1. FEPS is the mean annual one-year-ahead earnings per share forecasts for year t 
issued immediately following the earnings announcement for year t—1. Tenure is the duration of the auditor- 
client relationship in years starting from 1982. FirmAge measures the number of years that the firm has been 
publicly traded as of the fiscal year-end. 

The p-values for correlation coefficients are reported in parentheses. 


regression without the control variables using the full sample. More important, B, + B, is 
positive and significant (0.012, t-statistic = 3.45). Regression estimates indicate that inves- 
tors on average pay a premium of 2.4 percent (— 0.012/0.507) for earnings as the auditor- 
client relationship increases by an additional year. The coefficient on Tenure (B,) is also 
significant (0.005, t-statistic = 4.06), indicating that auditor tenure is positively associated 
with returns. 

The ERC is 0.432 (t-statistic — 9.00) in the second regression when we include a 
number of control variables. B, + B, remains positive and significant (0.008, t-statistic 
— 2.48). The parameter estimates suggest that, after controlling for the other determinants 
of ERC, investors pay a premium of 2 percent (= 0.008/0.432) for earnings as the audit 
engagement increases by an additional year. Even though the magnitude of B, + B, is 
smaller in the second regression, we find it reassuring that the results continue to be sig- 
nificant even after including many control variables. When we estimate the augmented 
model using the restricted sample for which the auditor-client relationship lasts for at least 
five years, we find similar results. Specifically, B, + B, is positive and significant at the 5 
percent level (0.008, t-statistic — 2.03) in the third regression. 

The parameter estimates of the components of ERC provide added insights into investor 
perceptions of the time-series properties of earnings. The coefficients on E and AE in all 
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. three regressions are positive and significant at the 1 percent level. Considering that earnings 
levels (earnings changes) are more important for pricing decisions when earnings contain 
transitory (permanent) components (Easton and Harris 1991; Ali and Zarowin 1992), the 
results suggest that investors view reported earnings as containing both transitory and per- 
manent components. Individual coefficients on E*Tenure (Q4) and AE*Tenure (B,) also 


TABLE 3 
Earnings Response Coefficients and Perceptions of Investors 
Full Sample 
Variables (Coefficients) (D (2) Restricted Sample 
Intercept (a) —0.037 (—5.82)*** 0.096 (—6.40)*** —0.088 (—5.87)*** 
E (B) 0.356 (6.91)*** 0.261 (4.51)*** 0.195 (2.92)*** 
AE (B4) 0.151 (8.09)*** 0.171 (5.04)*** 0.232 (5.60)*** 
(B, + Bj) 0.507 (10.30)*** — 0.432 (9.00)*** 0.427 (7.09)*** 
E*Tenure (B) 0.007 (2.49)** 0.006 (1.80)* 0.009 (2.18)** 
AE*Tenure (B4) 0.005 (3.39)*** 0.002 (1.34) —0.001 (—0.78) 
(B, + B) 0.012 (3.45)*** 0.008 (2.48)** 0.008 (2.03)** 
Tenure (Bs) 0.005 (4.06)*** 0.003 (5.68)*** 0.003 (5.29)*** 
Control Variables 
E*FirmAge (B,)/AE*FirmAge (B) 
(B, + B> —0.002 (— 1.75) —0.002 (—1.71) 
E*Big4 (B,)/ AE*Bie4 (Bo) 
s + Bs —0.091 (—2.25)**  —0.117 (-2.1D** 
E*Growth (B, )/ AE*Growth (B,.) 
(Bio + Bii) 0.023 (1.35) 0.035 (1.92)* 
E*Persistence (B,,)/ AE*Persistence (B,3) 
(Bi; + Bia) 0.084 (2.42)** 0.079 (2.19)** 
E*Volatility (B,,)/ AE*Volatility (Bs) 
Bi, + B —0.083 (—7.40)*** —0.092 (—7.52)*** 
E*Beta (B,9)/ AE*Beta (B,;) 
(Bis + Bi) —0.061 (—1.87)* | —0.070 (—1.99)* 
E*Size (B, a) / AE*Size (B,,) l 
(Bie + Bio) 0.119 (9.27)*** 0.132 (8.79)*** 


E*Leverage (B.,)/ AE*Leverage (Bz) 
(Boo + Bai) 


E*Regulation (B,,)/ AE*Regulation (Baa) 
(Boo + B>) 


—0.290 (—4.74)*** 


—0.296 (—4.21)*** 


—0.061 (—1.38) —0.051 (—1.27) 
FirmAge (Bo4) 0.001 (4.58)*** 0.001 (4.47)*** 
Big4 (B25) 0.047 (4.00)*** 0.041 (3.43)*** 
Growth (Bag) 0.023 (5.41)*** 0.024 (5.60)*** 
Persistence (B>) 0.014 (1.93)* 0.015 (2.28)** 
Volatility (Bas) —0.025 (—7.75)*** —0.025 (—8.53)*** 
Beta (B>) —0.005 (—0.18) —0.003 (—0.10) 
Size (Bao) —0.005 (—1.23) —0.005 (—1.39) 
Leverage (Bs) —0.018 (—0.71) —0.014 (—0.53) 
Regulation (B32) 0.045 (4.01)*** 0.044 (3.72)*** 
Yearly Observations 3,044—4,122 3,044—4,122 2,897—3,803 
Adjusted R? 0.039—0.162. 0.111—0.372 0.109—0.369 


(continued on next page) 
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*, ** and *** denote statistical significance at the 0.10, 0.05, and 0.01 level, respectively, for a two-tailed test. 
The full sample includes Compustat firms with available data from 1990 to 2000. The restricted sample consists 
of firms in the full sample with auditor-client relationships lasting for at least five years. The dependent variable 
is cumulative market-adjusted returns for the 12-month period ending three months after the fiscal year-end 
(CAR). Market-adjusted returns are the difference between raw returns and value-weighted CRSP market returns. 
E is income before extraordinary items deflated by market value of equity at the beginning of the year. AE is the 
difference between income before extraordinary items for the current year and that of last year deflated by 
market value of equity at the beginning of the year. Tenure is the duration of the auditor-client relationship in 
years starting from 1982. We suppress the individual coefficients on E*control variable and AE*control variable; 
instead, we report the sum of the two coefficients. The control variables are defined as follows: FirmAge 
measures the number of years that the firm is publicly traded as of the fiscal year-end; Big4 is an indicator 
variable that equals 1 when the client's auditor is a large accounting firm; Growth is the sum of the market 
value of equity and the book value of debt scaled by the book value of total assets; Persistence (Volatility) is the 
first-order autocorrelation (standard deviation) of income before extraordinary items per share for the past 16 
quarters; Beta is the systematic risk computed using the past 60 monthly stcck returns; Size is the logarithmic 
transformation of the fiscal year-end market value of equity of the prior year; Leverage is the ratio of total debt 
to total assets; and Regulation is an indicator variable that equals 1 for firms in a regulated industry with two- 
digit SIC codes between 40 and 49 or between 60 and 63, otherwise it equals 0. 

The reported coefficients are the average of yearly coefficients from 1990 to 2000, and the corresponding t- 
statistics in parentheses are based on the distribution of the yearly ccefficients. 


provide some insights: (1) B, is larger than B, in all three regressions; and (2) B, is statis- 
tically significant in all three regressions at the 10 percent level or below, while B, is 
significant in only one regression. The incremental importance of earnings levels in pricing 
decisions suggests that, as the auditor-client relationship lengthens, investors perceive re- 
ported earnings as being more transitory. 

We briefly discuss the results of the control variables because they have been exten- 
sively analyzed in prior studies (Subramanyam and Wild 1996; Warfield et al. 1995; 
Dhaliwal and Reynolds 1994; Collins and Kothari 1989). To conserve space, we do not 
report the individual coefficients on the interactions between each control variable and E/ 
AE. Instead, we report the sum of the two interaction coefficients. Controlling for other 
determinants of ERC, our results based on the restricted sample suggest that the ERC does 
not vary with FirmAge (Bs + B, is insignificant). Since Bg + B, is negative and significant, 
our results indicate that the ERC is lower for client firms of large auditing firms (Big4), 
which is inconsistent with the findings of Teoh and Wong (1993). As in prior studies, we 
find that the ERC is positively associated with Growth, Persistence, and Size, and is neg- 
atively associated with Volatility, Beta, Leverage, and Regulation. Also, the coefficients on 
FirmAge, Big4, Growth, Persistence, Volatility, and Regulation are all significant at the 5 
percent level or below. 


Bias in the Estimated ERC from Prices Leading Earnings 

A potential concern in Table 3 is that the estimated ERC from a contemporaneous 
returns-earnings regression might be biased downward if the market anticipates current year 
earnings more than one year prior to the earnings release (Kothari 1992; Kothari and Sloan 
1992). The estimated ERC could be biased from using a 12-month contemporaneous period 
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to compute returns or from using market value of equity as a deflator.'* Further, the down- 
ward bias in the estimated ERC is expected to be larger for firms with longer tenure. Given 
that prior studies generally find that earnings quality improves with extended tenure, and 
considering that high-quality earnings are easier to predict, the market is more likely to 
anticipate current-year earnings for firms with longer tenure. 

We mitigate potential biases in the estimated ERC in two ways. First, as in DeFond 
and Hung (2003) and Collins and Kothari (1989), we measure returns over a 15-month 
period ending three months after the fiscal year-end. Increasing the returns measurement 
window enhances the probability of including the period in which the market anticipates 
current earnings thereby reducing any measurement error. Second, as in Subramanyam 
(1996), we deflate earnings by total assets at the beginning of the year. Since total assets 
are typically based on historical cost rather than market value, any bias in the estimated 
coefficients from prices leading earnings is likely to be reduced from using total assets as 
a deflator. 

Panel A of Table 4 reports the results from using cumulative market-adjusted returns 
measured over 15 months (CAR, |) as the dependent variable. While the magnitude of 
ERC (B, + B) is smaller, we find that the size of the sum of the coefficients on E*Tenure 
and AE*Tenure (B, + B,) is 25 to 38 percent larger than that reported in Table 3. One 
reason why the ERC is smaller may be because of the large number of control variables. 
When we estimate the regressions in Panel A without the control variables, we find that 
the estimated ERC becomes larger. Specifically, unreported results show that the ERC for 
the full sample without the control variables is 0.568, while the corresponding estimate is 
0.507 in Table 3. 

We find similar results using total assets as a deflator for E and AE. The estimate of 
B, + B4 using the full (restricted) sample is 0.027 (0.025) in Panel B of Table 4, while the 
corresponding estimate in Table 3 is 0.008 (0.008). Thus, the magnitude of B, + B, is 213 
to 238 percent larger in Panel B compared to that reported in Table 3. The magnitude of 
ERC is also larger in Panel B than that in Table 3. Hence, the results in Table 4 suggest 
that investors are more likely to anticipate current-year earnings more than one year ahead 
of the earnings release for firms with extended auditor-client relationships. 

Overall, the results from Tables 3 and 4 are consistent with the hypothesis that investors 
perceive earnings quality as improving with longer auditor tenure. Our results suggest that 
investors view audit quality as improving with auditor tenure. 


Perceptions of Information Intermediaries and Auditor Tenure 
S&P Stock Rankings and Debt Ratings 

Unlike stock returns, stock rankings and debt ratings are possibly correlated over time. 
Therefore, the residuals from yearly regressions may not be independent, which could lead 
to biased t-statistics. We compute an autocorrelation-corrected t-statistic (t-statistic^C) by 
comparing the mean coefficients with the time-series-based standard error that accounts for 


12 1f prices lead earnings, current period earnings contain information that is "new" to the market and information 
that is “stale” or “old.” Given that stock prices change in response to new information, the independent variable 
contains errors of measurement from using current period earnings as the regressor (also known as errors-in- 

: variable problem). Measurement error on the independent variable biases the slope coefficient downward because 
of its correlation with the regression disturbance term (Greene 1993). Similarly, deflating earnings by market 
value of equity might inject a downward bias in the estimated coefficients when the market anticipates current 
period earnings at the beginning of the year. 


The Accounting Review, April 2005 


TABLE 4 
Earnings Response Coefficients and Perceptions of Investors: Sensitivity Analysis 
Variables (Coefficients) Full Sample Restricted Sample 
Panel A: CAR,, is the Dependent Variable 
E (B) 0.217 (2.62)** 0.165 (1.95)* 
AE (B) 0.169 (3.80)*** 0.248 (4.07)*** 
(B, + By) 0.386 (5.50)*** 0.413 (5.21)*** 
E*Tenure (Ba) 0.008 (1.65) 0.011 (1.92)* 
AE*Tenure a) 0.003 (1.30) —0.001 (—0.91) 
(B4 + BÀ) 0.011 (2.84)*** 0.010 (2.24)** 
Tenure (Bs) 0.004 (5.99)*** 0.003 (5.37)*** 
Control Variables (as in Table 3) Included Included 
Yearly Observations 3,044 4,122 2,897—3,803 
Adjusted R? 0.145—0.322 0.154—0.327 


Panel B: Earnings are Scaled by Total Assets 


Ghosh and Moon 


E (B) 0.415 (5.06)*** 0.393 (5.09)*** 
AE (B-) 0.379 (3.99)*** 0.468 (3.69)*** 
(B, + B5) 0.794 (8.33)*** 0.861 (8.57)*** 
E*Tenure (B4) 0.006 (0.84) 0.012 (1.16) 
AE*Tenure (B4) 0.021 (2.46)** 0.013 (1.58) 
(B, + B) 0.027 (2.41)** 0.025 (2.38)** 
Tenure (B) 0.002 (2.68)** 0.001 (0.94) 
Control Variables (as in Table 3) Included Included 
Yearly Observations 3,218—4,207 3,063-3,888 
Adjusted R? 0.095—0.278 0.091—0.288 


* ** and *** denote statistical significance at the 0.10, 0.05, and 0.01 level, respectively, for a two-tailed test. 
The full sample includes Compustat firms with available data from 1990 to 2000. The restricted sample consists 
of firms in the full sample with auditor-client relationships lasting for at least five years. In Panel A, the 
dependent variable is cumulative market-adjusted returns for the 15-month period ending three months after the 
fiscal year-end (CAR smon). Market-adjusted returns are the difference between raw returns and value-weighted 
CRSP market returns. E is income before extraordinary items deflated by market value of equity at the 
beginning of the year. AE is the difference between income before extraordinary items for the current year and 
that of last year deflated by market value of equity at the beginning of the year. In Panel B, the dependent 
variable is CAR measured over the 12-month period ending three months after the fiscal year-end. E is income 
before extraordinary items deflated by total assets at the beginning of the year. AE is the difference between 
income before extraordinary items for the current year and that of last year deflated by total assets at the 
beginning of the year. The results of all the control variables in Panels A and B are not reported for brevity. 
The reported coefficients are the average of yearly coefficients from 1990 to 2000, and the corresponding 


t-statistics in parentheses are based on the distribution of the yearly coefficients. 


any dependence in the estimated coefficients, as in Abarbanell and Bernard (2000).!? Since 


'3 We assume that serial correlation is second-order autoregressive, so that the correction involves multiplying the 
(1 + 122) - 2*r*(1— r” 


Q Di Lm re 
to compute the mean coefficient and r is the average of the first- and second-order autocorrelations estimated 
from the series of annual coefficients. The correction factor is not applied when estimated autocorrelation is less 
than zero. 


yearly standard errors by the factor | where n is the number of observations used 
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our focus is on auditor tenure, we compute t-statistic^ for coefficients that involve Tenure 


only. 


Table 5 reports the regression results of common stock rankings on auditor tenure. The 
first column in Table 5 presents the results from the reduced model using the full sample. 


TABLE 5 


Stock Rankings and Perceptions of Information Intermediaries 


Variables (Coefficients) 
Intercept (a) 
E (Bi) 
AE (B> 

(B, + B> 
E*Tenure (B3) 
AE*Tenure (Ba) 

(B, + Ba) 
Tenure (Bs) 
Control Variables 


5.746 (100.31)*** 
—1969 (—5.25)*** 


1.314 (4.77)*** 
—0:6551—525)*** 


—0.100 (—3.80)*** 


0.046 (2.39)** 
—0.054 (—3.76)*** 


—0.068 (—6.05)*** 


E*FirmAge (Bj)/ AE*FirmAge (B) 
(Bs + By) 


E*Big4 (B,)/AE*Big4 (B) 
(Ba + 


8 


E*Growth (B,,)/AE*Growth (Bi) 
(Bio + B1) 


E*Persistence (B,,)/ AE*Persistence (B,3) 


(Bi; + Bis) 


E*Volatility (B,,)/AE*Volatility (B,s) 


Bis 


E*Beta (B19/ AE*Beta (B,7) 


(Bis + Bry) 


E*Size (B,,)/AE*Size (Bio) 


(Bis + Bis) 


Bis 


E*Leverage (B,9)/ AE*Leverage (B21) 
( 


20 


1 


E*Regulation (B..)/AE*Regulation (Ba) 
Bos) 


22 
FirmAge (B24) 
Big4 (Bos) 
Growth (Bos) 
Persistence (B27) 
Volatility (Bag) 
Beta (B29) 
Size (B30) 
Leverage (Bs) 
Regulation (B32) 
Yearly Observations 
Adjusted R? 


2,019—2,637 
0.125-0.169 


Full Sample 


—— ns  ——]O —ə—F A 


6.610 (140.79)*** 
—0.208 (— 0.91) 


—0.148 (—0.58) 
-D.356(—2.03)* 


—0.061 (—5.16)*** 


0.021 (1.93)* 
—0.040 (—3.31)*** 


=0.023 (9.33) *** 
0.017 (2.45)** 
0.264 (2.54)** 

—0.134 (—1.66) 
0.536 (3.40)*** 
0.189 (2.60)** 
0.265 (4.21)*** 

—0.208 (—4.40)*** 
0.489 (2.62)** 


0.038 (0.28) 
0.001 (0.88) 
0.250 (11.51)*** 
0.033 (2.66)** 
—0.032 (—0.79) 
0.309 (15.93)*** 
0.372 (8.71)*** 


—0.375 (—26.10)*** 


0.407 (5.77)*** 
—0.207 (—6.00)*** 

2,019—2,637 

0.463—0.523 


Restricted Sample. 


6.614 (149.22)*** 
—0.366 (—1.01) 
—0.030 (—0.08) 
—0.396 (—1.84)* 
—0.063 (—5.87)*** 

0.023 (1.96)* 
—0.040 (—2.97)*** 
—0.024 (—9.79)*** 


0.021 (2.72)** 
0.262 (2.61)** 
—0.158 (—1.77) 
0.625 (4.23)*** 
0.193 (2.64)** 
0.320 (3.79)*** 
—0.226 (—4.09)*** 
0.438 (1.83)* 


0.040 (0.24) 
0.000 (0.46) 
0.256 (14.51)*** 
0.030 (3.19)*** 
—0.023 (—0.53) 
0.329 (15.88)*** 
0.373 (9.44)*** 
—0.371 (—24.15)*** 
0.396 (5.63)*** 
—0.205 (—6.10)*** 
1,949-2,487 
0.448-0.519 


(continued on next page) 
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+, ** and *** denote statistical significance at the 0.10, 0.05, and 0.01 level, respectively, for a two-tailed test. 
The full sample includes Compustat firms with available data from 1990 to 2000. The restricted sample consists 
of firms in the full sample with auditor-client relationships lasting for at least five years. The dependent variable 
is S&P common stock rankings converted into numerical values (Stock Rankings). Stock Rankings take the 
values of 1 to 7 representing S&P common stock rankings of A+, A, A—, B+, B, B—, and C, respectively. 
Tenure is the duration of the auditor-client relationship in years starting from 1982. E is income before 
extraordinary items deflated by market value of equity at the beginning of the year. AE is defined as the 
difference between income before extraordinary items for the current year and that of last year deflated by 
market value of equity at the beginning of the year. We suppress the individual coefficients on E*control 
variable and AE*control variable; instead, we report the sum of the two coefticients. The control variables are 
defined as follows: FirmAge measures the number of years that the firm is publicly traded as of the fiscal year- 
end; Big4 is an indicator variable that equals 1 when the client's auditor is a large accounting firm; Growth is 
the sum of the market value of equity and the book value of debt scaled by the book value of total assets; 
Persistence (Volatility) is the first-order autocorrelation (standard deviation) of income before extraordinary items 
per share for the past 16 quarters; Beta is the systematic risk computed using the past 60 monthly stock returns; 
Size is the logarithmic transformation of the fiscal year-end market value of equity of the prior year; Leverage is 
the ratio of total debt to total assets; and Regulation is an indicator variable that equals 1 for firms in a regulated 
industry with two-digit SIC codes between 40 and 49 or between 60 and 63, and 0 otherwise. 

The reported coefficients are the average of yearly coefficients from 1990 to 2000, and the corresponding 
t-statistics in parentheses are based on the distribution of the yearly coefficients. 





As in Table 3, ERC, a proxy for earnings quality as perceived by ratings agencies, is defined 
as the sum of the coefficients on E and AE.!^ The ERC is negative (—0.655) and statistically 
significant at the 1 percent level (t-statistic = —5.25). B, + B,, which measures the joint 
effects of tenure and earnings on rankings, is also negative (—0.054) and significant at the 
1 percent level (t-statistic = —3.76; t-statistic^" = —3.14). All else equal, for a given 
increase in earnings, Stock Rankings improve by an additional 8.2 percent (= —0.054/ 
—0.655) when the audit engagement increases by an added year. 

The coefficient on Tenure (B,) is negative (—0.068) and significant at the 1 percent 
level (t-statistic = —6.05; t-statistic^C = — 3.52). Other things remaining constant, if auditor 
tenure for a firm is one standard deviation above the mean value of Tenure, the stock ranking 
is lower by 0.327 (= —0.068 x 4.807), compared to the average stock ranking of 5.096 
for the sample used in the regression. 

The second column in Table 5 reports the results from the augmented model using the 
full sample. B, + B, remains negative (—0.040) and significant (t-statistic = —3.31; 
t-statistic^^ = —3.31). B, is also negative (—0.023) and significant (t-statistic = —9.33; 
t-statistic^" = -—6.59). The results are very similar when we use the restricted sample 
(reported in the last column). B, + B, is negative (—0.040) and significant (t-statistic = 
—2.97; t-statistic^" = —2.97), and B, is highly significant (—0.024, t-statistic = —9.79; 
t-statistic^" = —6.78). Our results suggest that rating agencies place a greater weight on 
reported earnings as auditor tenure increases. Given that S&P incorporates the quality of 
accounting information as a factor in establishing its rankings/ratings (Standard & Poor's 
1982), the importance of tenure suggests that, as tenure increases, independent rating agen- 
cies perceive audited financial statements as being more reliable. 

We briefly discuss how various control variables affect Stock Rankings (based on the 
full sample). The coefficients on the interactions between earnings and FirmAge, Persist- 
ence, Volatility, Beta, and Leverage are positive and significant at the 5 percent level or 


i+ Although the term ERC or earnings response coefficient is typically used for earnings coefficients in returns- 
earnings regressions, we use the term in a broader context to refer to earnings coefficients for the returns, stock 
rankings, debt ratings, and analyst forecasts regressions. 
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below. These results suggest that the effect of earnings on Stock Rankings is smaller for 
mature/risky firms. The positive sign of the coefficient on the interactions between earnings 
and Big4 (B4 + Bs) is puzzling. The coefficient on the interactions between earnings and 
Size (B, s + Bio) is negative and significant at the 1 percent level, indicating that the effect 
of earnings on Stock Rankings is larger for bigger firms. The coefficients on the interactions 
between earnings and Growth (Bio + B,,) and those between earnings and Regulation (Ba 
t Ba) are both insignificant. 

The control variables are also included as separate independent variables. The positive 
sign of the coefficient on Big4 is again puzzling. The coefficients on Volatility, Beta, and 
Leverage are all positive and significant, which suggests that higher risk has an unfavorable 
impact on stock rankings. The coefficient on Growth is also positive, which is inconsistent 
with our expectations. If our model does not adequately control for risk, growth firms are 
more likely to be associated with unfavorable Stock Rankings because higher growth could 
proxy for larger risk. The coefficients on Size and Regulation are negative, which suggests 
that stock rankings are more favorable for larger firms and those in regulated industry. The 
coefficients on FirmAge and Persistence are insignificant. 

Table 6 presents the regression results of the association between debt ratings and 
auditor tenure. The first column in Table 6 presents the results of estimating the reduced 
model using the full sample. Consistent with prior research, the ERC is negative and sig- 
nificant (—4.568, t-statistic = —5.87). Thus, earnings have a favorable influence on debt 
ratings. The sum of the coefficients on E*Tenure and AE*Tenure (B, + B4) 1s also negative, 
but insignificant (—0.093, t-statistic = — 1.03; t-statistic^* = —0.98). In contrast, the coef- 
ficient on Tenure (B;) is negative and significant (—0.183, t-statistic = —6.33; t-statistic^^ 
= —3.42), consistent with the findings of Mansi et al. (2004). 

Once we include the control variables in Table 6, the ERC is insignificant in either the 
second or the third regression. Although B, + B, is marginally significant (—0.216, t-statistic 
= —1.98; t-statistic^" = — 1.85) for the full sample, it is insignificant (—0.240, t-statistic 
= —].66; t-statistic’ = — 1.62) for the restricted sample. On the other hand, B, remains 
negative and significant for the full sample (—0.065, t-statistic = —4.78; t-statistic*© 
= —2.70) and the restricted sample (—0.067, t-statistic = —4.51; t-statisticA© = —2.70). 
Thus, although longer tenure is associated with improved debt ratings, the association be- 
tween earnings and debt ratings does not appear to vary with tenure. 

In a recent study, Mansi et al. (2004) find that auditor tenure is negatively related to 
the cost of debt and that the relation is more pronounced for non-investment-grade debt. 
This finding suggests that auditor tenure could influence debt ratings differently for invest- 
ment and non-investment-grade debt. Following Mansi et al. (2004), we separately estimate 
the augmented model for investment and non-investment-grade debt to investigate whether 
the impact of auditor tenure on debt ratings depends on the quality of debt. When we 
partition our sample into investment and non-investment-grade debt subsamples, the results 
are very similar to those reported in Table 6. For example, B, + B, is negative but not 
significant in either subsample, but B, is negative and significant for both subsamples (for 
brevity, the results are not tabulated). 

In sum, the regression results from stock rankings and debt ratings provide modest 
evidence that independent rating agencies perceive firms with longer auditor tenure as 
providing more reliable financial information. 


15 Since the same control variables appear in the stock rankings and debt ratings regressions, and the results are 
similar across the two sets of regressions, we do not repeat our discussion of the control variables for the debt 
ratings regressions. 
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Financial Analysts? Earnings Forecasts 

Table 7 reports the results of how auditor tenure affects the association between one- 
year-ahead consensus earnings per share forecasts and earnings per share reported just prior 
to the forecasts. As in the previous tables, our interest is in the sum of the coefficients on 
EPS, ,* Tenure and AEPS, ,*Tenure (B, + B,). The first column in Table 7 reports the 


FirmAge (B24) 

Big4 (B25) 

Growth (Bog) 

Persistence (Bz) 

Volatility (Bos) 

Beta (Bas) 

Size (Bso) 

Leverage (B31) 

Regulation (B32) 
Yearly Observations 603—1,124 
Adjusted R? 0.132—0.245 


The Accounting Review, April 2005 


—0.014 (—7.46)*** 
1.473 (3.13)*** 

—0.102 (—1.94)* 
0.306 (4.78)*** 
0.254 (7.55)*** 
1.359 (12.12)*** 


TABLE 6 
Debt Ratings and Perceptions of Information Intermediaries 
Full Sample 
Variables (Coefficients) (1) | (2) Restricted Sample 
Intercept (a) 11.101 (64.35)*** 14.387 (42.74)*** 14.032 (31.54)*** 
E (B) —8.796 (—3.37)*** —1.378 (—0.21) 2.596 (0.40) 
AE (B) 4.228 (2.13)** 14.567 (1.13) 11.871 (0.88) 
(B, B5)  —4.568 (—5.87)*** 13.189 (1.02) 14.467 (1.13) 
E*Tenure (B4) —0.169 (—0.84) 0.014 (0.18) 0.020 (0.22) 
AE*Tenure (B4) 0.076 (0.54) —0.230 (—2.06)** —0.260 (—1.89)* 
(Ba + B) —0.093 (—1.03) —0.216 (—1.98)* —0.240 (—1.66) 
Tenure (Bs) —0.183 (—6.33)*** —0.065 (—4.78)*** | —0.067 (—4.51)*** 
Control Variables 
E*FirmAge (B4)/ AE*FirmAge (B,) 
(Bs + B> 0.040 (3.11)*** 0.040 (2.63)** 
E*Big4 (B,)/AE*Big4 (Bo) 
(Bs + Bo) —13.630 (— 1.05) —15.302 (—1.17) 
E*Growth (Bio) / AE*Growth (B) 
wt Bu 1.333 (1.61) 1.641 (1.54) 
E*Persistence (B,,)/ AE*Persistence (8,3) 
a Bis 1.284 (3.17)*** 1.280 (2.97)*** 
E*Volatility (B,,)/AE*Volatility (B,s) 
(Bi, + Bis 0.068 (0.57) 0.114 (0.67) 
E*Beta (B, |¿)/ AE*Beta (B,,) 
(Bis + Biz) 1.011 (2.11)** 1.083 (2.26)** 
E*Size (B))/ AE*Size (Bis) 
Bis + Bio) —0.594 (—2.67)** 250:552 (=2.2))** 
E*Leverage (B49)/ AE*Leverage (B21) 
(Bop + Bay 0.449 (0.29) 0.512 (0.32) 
E*Regulation (B,.)/AE*Regulation (B.) 
(B4, + Ba) —1.127 (—1.84)* —1.271 (—1.99)* 


—0.014 (—7.13)*** 
1.914 (3.80)*** 

—0.108 (—1.98)* 
0.305 (4.60)*** 
0.223 (5.36)*** 
1.333 (11.66)*** 


—1.087 (—31.65)*** —1.088 (—29.81)*** 


3.676 (13.00)*** 


3.663 (12.30)*** 


—1.025 (—10.18)*** —1.017 (—10.20)*** 


603—1,124 
0.602—0.699 


598—1,051 
0.596—0.697 
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* **. and *** denote statistical significance at the 0.10, 0.05, and 0.01 level, respectively, for a two-tailed test. 
The full sample includes Compustat firms with available data from 1990 to 2000. The restricted sample consists 
of firms in the full sample with auditor-client relationships lasting for at least five years. The dependent variable 
is S&P senior debt ratings converted into numerical values (Debt Ratings). Debt Ratings are assigned a value of 
| if a firm's S&P senior debt is rated as AAA. As the S&P debt ratings decline, the numerical value increases 
by 1. Tenure is the duration of the auditor-client relationship in years starting from 1982. E is income before 
extraordinary items deflated by market value of equity at the beginning of the year. AE is defined as the 
difference between income before extraordinary items for the current year and that of last year deflated by 
market value of equity at the beginning of the year. We suppress the individual coefficients on E*control 
variable and AE*control variable; instead, we report the sum of the two coefficients. The control variables are 
defined as follows: FirmAge measures the number of years that the firm is publicly traded as of the fiscal year- 
end; Big4 is an indicator variable that equals 1 when the client's auditor is a large accounting firm; Growth is 
the sum of the market value of equity and the book value of debt scaled by the book value of total assets; 
Persistence (Volatility) is the first-order autocorrelation (standard deviation) of income before extraordinary items 
per share for the past 16 quarters; Beta is the systematic risk computed using the past 60 monthly stock returns; 
Size is the logarithmic transformation of the fiscal year-end market value of equity of the prior year; Leverage is 
the ratio of total debt to total assets; and Regulation is an indicator variable that equals 1 for firms in a regulated 
industry with two-digit SIC codes between 40 and 49 or between 60 and 63, and 0 otherwise. 

The reported coefficients are the average of yearly coefficients from 1990 to 2000, and the corresponding 
t-statistics in parentheses are based on the distribution of the yearly coefficients. 


results of the reduced model for the full sample. We find that reported earnings are posi- 
tively associated with analysts’ earnings forecasts. The ERC (B, + B.) is positive and highly 
significant (1.008, t-statistic = 20.29). Moreover, B, + B, is also positive and significant 
(0.019, t-statistic = 2.68). Other things remaining constant, our results suggest that analysts 
appear to attach greater importance to the most recent reported earnings in forming their 
expectations about future earnings as auditor tenure grows longer. 

When we estimate the augmented model that includes a number of control variables, 
B, + B, remains positive and significant (0.016, t-statistic = 2.90) in the second regression. 
The results are very similar when we estimate the augmented model using the restricted 
sample. B4 + B, continues to be positive and significant in the third regression (0.016, 
t-statistic = 2.51). On the other hand, B, is insignificant in all three regressions. 

The individual parameter estimates of ERC and B, + B, provide some added insights. 
The coefficients on EPS, , (B,) and AEPS, , (B2) are both significant in the first regression. 
However, in the second and third regressions that include the other determinants of earnings 
forecasts, B, becomes insignificant. Large earnings surprises are more likely to be temporary 
and, therefore, potentially less useful to analysts in predicting future earnings (Barron et 
al. 2002; Stickel 1989). In contrast, the coefficient on AEPS,. , *Tenure (B,) is significant in 
all three regressions at the 10 percent level or below. The results suggest that although past 
earnings changes do not influence earnings forecasts, they are perceived as more relevant 
for earnings forecasts as the auditor-client relationship increases. 

Based on the results from the restricted sample, we briefly discuss how the various 
control variables affect analysts’ earnings forecasts (FEPS,). The coefficients on the inter- 
actions between reported earnings and FirmAge, Leverage, and Analysts are all negative 
and significant at the 10 percent level or below. These results suggest that the effect of 
reported earnings on FEPS, is smaller for mature/risky firms and for firms with larger 
analyst following. The interaction between reported earnings and Size is positive and sig- 
nificant, which suggests that the impact of reported earnings on analysts' earnings forecasts 
is greater for bigger firms. Also, the individual coefficients on FirmAge, Volatility, Leverage, 
and Analysts are all positive and significant at the 5 percent level or below, which suggests 
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that analysts' earnings forecasts are higher for mature firms, for firms with higher risk, and 
those with larger analyst following. The coefficient on Size is negative, which suggests that 
earnings per share forecasts are smaller for bigger firms. One explanation is that large, 
politically sensitive firms are more likely to report lower earnings to reduce their political 
costs and, therefore, analysts' earnings forecasts are also lower for larger firms. 


TABLE 7 


Analyst Earnings Forecasts and Perceptions of Information Intermediaries 


Variables 


EPS, ,*Tenure 
AEPS, ,*Tenure 


Tenure 
Control Variables 


(Coefficients) 


—  . ENE  P'2——sskL TH —— M 


Full Sample 


0.217 (8.84)*** 
0.834 (22.34)*** 


0.174 (4.59)*** 
1.008 (20.29)*** 


0.005 (1.75) 


0.014 (2.00)** 
0.019 (2.68)** 


—0.000 (—0.15) 


EPS, ,*FirmAge (B)! AEPS, ,*FirmAge (B.) 
B7) 


6 


EPS,_,*Big4 (B)/ AEPS, ,*Big4 (Bo) 


s + By 


EPS,_,*Growth (B,,)/ AEPS, ,*Growth (8,1) 


(Bio + Bir) 


EPS, ,*Volatility (B,.)/AEPS,_,*Volatility (B,3) 


(Bi; + Bis 


EPS, ,*Beta (B.V AEPS 1?! Beta (B,5) 


14 T 15 


EPS. ,*Size (B,.)/ AEPS, ,*Size (By) 


16 + i7 


EPS, ,*Leverage (Bia) AEPS, *Leverage (Big) 


18 T 19 


EPS, ., *Regulation (B5;)/ AEPS,_,*Regulation (Bz) 


(Boo + P21) 


EPS, ,*Analysts (B.,)/AEPS,_, *Anatysts (Ba) 
(B> + Bas) 


FirmAge 
Big4 
Growth 
Volatility 
Beta 
Size 
Leverage 
Regulation 
Analysts 
Yearly Observations 
Adjusted R? 


The Accounting Review, April 2005 


(B24) 
(B25) 
(B26) 
(B27) 
(Bos) 
(Bao) 
(B30) 
(B31) 
(B32) 


1,100-1,855 
0.736-0.878 


0.366 (15.47)*** 
0.532 (15.28)*** 
—0.068 (—0.66) 
0.464 (4.52)*** 
0.005 (1.79) 


0.011 Q.11)** 
0.016 (2.90)*** 


—0.003 (—1.16) 


—0.004 (—2.83)*** 
—0.060 (—1.29) 
—0.008 (—0.33) 
—0.032 (—0.67) 
—0.011 (—0.37) 
0.140 (7.17)*** 
-0.235 (-2.18)** 
—0.041 (—1.26) 


—0.013 (—3.85)*** 
0.003 (4.92)*** 
0.028 (1.39) 

—0.003 (—0.91) 
0.138 (3.04)*** 

—0.004 (—0.28) 

—0.048 (—12.29)*** 
0.174 (5.06)*** 
0.020 (0.89) 
0.002 (1.82)* 
1,100—1,855 
0.759—0.891 


Restricted Sample. 


0.364 (14.87)*** 
0.530 (15.64)*** 


—0.074 (—0.70) 
0.456 (4.49)*** 


0.005 (1.54) 


0.011 (1.95)* 
0.016 (2.51)** 


—0.002 (—0.76) 


—0.004 (—2.90)*** 
—0.077 (—1.49) 
—0.010 (—0.41) 
—0.033 (—0.67) 
—0.012 (—0.38) 
0.147 (8.00)*** 
50,225 (--1.99)* 
—0.034 (—1.04) 


—0.014 (—4.20)*** 
0.004 (4.95)*** 
0.034 (1.54) 

—0.003 (—1.00) 
0.138 (2.86)*** 

—0.002 (—0.18) 

—0.050 (—12.99)*** 
0.168 (4.41)*** 
0.013 (0.55) 
0.002 (2.17)** 
1,074—1,735 
0.757—0.893 
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* ** and *** denote statistical significance at the 0.10, 0.05, and 0.01 level, respectively, for a two-tailed test. 
The full sample includes Compustat firms with available data from 1990 to 2000. The restricted sample consists 
of firms in the full sample with auditor-client relationships lasting for at least five years. The dependent variable 
is the mean annual one-year-ahead earnings per share forecasts for year t issued immediately following the 
earnings announcement for year t—1 (FEPS,). EPS, is annual earnings per share for year t-1, and AEPS, , is 
the absolute change in earnings per share for year t—1 defined as the difference in annual earnings per share in 
year t—1 and that in year t—2 (IEPS, , — EPS,_,|). All earnings per share variables (FEPS,, EPS, ,, and 

AEPS, ,) are obtained from I/B/E/S database. Tenure is the duration of the auditor-client relationship in years 
starting from 1982. All the control variables including Tenure are measured as of the end of year t—1. We 
suppress the individual coefficients on E*control variable and AE*control variable; instead, we report the sum of 
the two coefficients. The control variables are defined as follows: FirmAge measures the number of years that 
the firm is publicly traded as of the fiscal year-end; Big4 is an indicator variable that equals 1 when the client's 
auditor is a large accounting firm; Growth is the sum of the market value of equity and the book value of debt 
scaled by the book value of total assets; Volatility is the standard deviation of income before extraordinary items 
per share for the past 16 quarters; Beta is the systematic risk computed using the past 60 monthly stock returns; 
Size is the logarithmic transformation of the fiscal year-end market value of equity of the prior year; Leverage is 
the ratio of total debt to total assets; Regulation is an indicator variable that equals 1 for firms in a regulated 
industry with two-digit SIC codes between 40 and 49 or between 60 and 63, and otherwise 0; and Analysts is 
the number of analysts providing annual earnings forecasts. 

The reported coefficients are the average of yearly coefficients from 1990 to 2000, and the corresponding 
t-statistics in parentheses are based on the distribution of the yearly coefficients. 


A key advantage of using consensus forecasts is that the aggregation process offsets 
individual forecast errors, or idiosyncratic errors, which results in the consensus forecasts 
being more accurate than the individual forecasts (Brown et al. 1985). However, one con- 
cern is.that the idiosyncratic errors may not be fully diversified away when the consensus 
is based on a few individual forecasts (Barron et al. 1998). Although any measurement 
error in the dependent variable does not bias the estimated coefficients, the estimated var- 
iances are larger and therefore the t-statistics are also lower (Gujarati 1995). One solution 
to the potential measurement error problem is to impose a restriction on the number of 
individual forecasts required to compute the mean forecast. The diversification argument 
suggests that the measurement error declines with an increase in the number of individual 
forecasts. 

Consistent with the diversification argument, we find that the magnitude of the t- 
statistics becomes larger when we impose restrictions on the number of individual forecasts. 
For instance, B, + B, is 0.01 and the associated t-statistic is 2.90 in the second regression 
(Table 7). However, as in Barron et al. (2002), when we require that the consensus forecasts 
be based on a minimum of three individual forecasts, B, + B, is 0.01 but the t-statistic 
increases to 3.40. The t-statistic jumps to 4.67 when the consensus forecast is based on a 
minimum of seven individual forecasts. 

Thus, our results from analyst earnings forecasts are consistent with the hypothesis that 
financial analysts perceive earnings quality as improving with longer auditor tenure. 


Sensitivity Analysis 

We conduct a number of additional analyses to test the robustness of our results. Instead 
of tabulating the results of the additional tests, we discuss them briefly in this subsection. 
First, given that raw returns are frequently used to estimate contemporaneous returns- 
earnings models (Lundholm and Myers 2002; Warfield et al. 1995; Collins and Kothari 
1989), we estimate the returns-earnings regression model using 12-month compounded raw 
returns (ending three months after the fiscal year-end) as the dependent variable. The results 
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based on raw returns are very similar to those reported in Table 3 using cumulative market- 
adjusted returns. The sum of the coefficients on E*Tenure and AE*Tenure for the full and 
restricted samples is 0.010 (t-statistic = 2.50) and 0.009 (t-statistic = 2.15), respectively. 
The coefficient on Tenure is also positive and significant for the full sample (0.004, 
t-statistic = 5.52) and the restricted sample (0.003, t-statistic = 4.84). 

Second, considering that cash flow might be an important determinant of debt ratings, 
we estimate Debt Ratings regressions after including operating cash flow (deflated by mar- 
ket value of equity at the beginning of the year) in addition to all the other variables. The 
inclusion of the cash flow variable does not affect the tenor of the results reported in Table 
6. The coefficient on Tenure continues to be negative and significant for the full sample 
(—0.066, t-statistic = —4.91) and the restricted sample (—0.067, t-statistic = —4.54). The 
coefficient on cash flow is not significant. 

Third, we also analyze S&P subordinated debt ratings (Compustat #320). The sample 
size is relatively small: the number of observations used in the yearly regressions varies 
between 251 and 372 for the full sample and between 245 and 358 for the restricted sample. 
The results from S&P subordinated debt ratings are very similar to those using senior debt 
ratings. The coefficient on Tenure is negative and significant for the full sample (—0.047, 
t-statistic — —2.96) and for the restricted sample (—0.031, t-statistic — —1.89). The sum 
of the coefficients on E*Tenure and AE*Tenure is negative but insignificant for both 
samples. 

Fourth, we estimate the effects of tenure on rankings/ratings using an ordered probit 
model. Since Stock Rankings and Debt Ratings (the dependent variables) are discrete out- 
comes, conventional regression methods could yield biased estimates (Greene 1993). We 
find that the results in Tables 5 and 6 are not affected when we use an ordered probit 
model. For example, the coefficient on Tenure is negative and highly significant for the 
Stock Rankings regression (—0.036, x? = 732.74) and the Debt ratings regression (—0.039, 
X^ = 473.14) for the full sample. 

Fifth, the results reported in Table 7 are based on levels specification—earnings per 
share forecasts (FEPS) and the two reported éarnings per share measures (EPS and AEPS) 
are not deflated. One concern is that the ordinary least squares estimates might be inefficient 
and/or biased because of scale differences (Barth and Kallapur 1996; Dechow 1994). Our 
conclusions remain unchanged when we estimate the analyst forecast model after (1) de- 
flating all the earnings per share variables (FEPS, EPS, and AEPS) by the stock price per 
share at the beginning of the forecast period, and (2) deleting the top and bottom 1 percent 
of the observations for each variable. The sum of the coefficients on EPS, ,*Tenure and 
AEPS, ,*Tenure remains positive and significant for the full sample (0.015, t-statistic 
= 3.05) and the restricted sample (0.013, t-statistic = 2.08). 

Finally, based on the specification used in prior studies, the analyst forecast model does 
not include earnings persistence. However, to maintain consistency with the returns, stock 
rankings and debt ratings specifications, we also examine whether the analyst forecasts 
results are robust to the inclusion of Persistence. When we include EPS, ,*Persistence, 
AEPS, ,* Persistence, and Persistence as additional independent variables, our conclusions 
remain unaffected. The sum of the coefficients on EPS, ,*Tenure and AEPS, ,*Tenure re- 
mains positive and significant for the full sample (0.017, t-statistic — 3.20) and the restricted 
sample (0.016, t-statistic = 2.90). 


VL CONCLUSIONS 
This study provides insights into the recent debate surrounding auditor tenure, inde- 
pendence, and audit quality by analyzing the relationship between auditor tenure and audit 
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quality as perceived by capital market participants. We focus on a market-based approach 
because academics, regulatory institutions, and practitioners frequently emphasize the im- 
portance of incorporating capital market perceptions of independence and audit quality 
(Dopuch et al. 2003; Ryan et al. 2001; SEC 2000; Carmichael 1999; AICPA 1994). Based 
on the association between perceived reliability of financial statements and auditor tenure, 
we infer how capital markets view tenure as affecting audit quality. All else equal, if capital 
market participants perceive auditor tenure as improving (impairing) audit quality, then 
reported financial statements are viewed as more (less) reliable for investment and credit 
decisions. | 

We analyze the perceptions of three primary users of audited financial statements: 
investors, independent rating agencies, and financial analysts. Using earnings response co- 
efficients from returns-earnings regressions as a proxy for investor perceptions of earnings 
quality (Schipper and Vincent 2003), we find evidence consistent with the hypothesis that 
investors perceive earnings quality as improving with auditor tenure. Our analysis of the 
perceptions of independent rating agencies is based on how tenure affects the association 
between rankings/ratings and reported earnings. We find that the influence of reported 
earnings on stock rankings becomes larger with extended tenure, although the influence of 
reported earnings on debt ratings does not vary with tenure. Thus, our results provide 
limited evidence that independent rating agencies view auditor tenure as having a favorable 
impact on earnings quality. Finally, using earnings coefficients from a regression of one- 
year-ahead consensus earnings forecasts on reported earnings as a proxy for analysts' per- 
ceptions of earnings quality, we find evidence consistent with the hypothesis that financial 
analysts perceive earnings quality as improving with extended auditor-client relationships. 

In general, most of our results are consistent with the hypothesis that audited financial 
statements, and in particular reported earnings, are perceived as more reliable for firms with 
longer auditor tenure. One implication of our results is that many capital market participants 
view longer tenure as having a favorable impact on audit quality. Overall, our study suggests 
that imposing mandatory limits on the duration of the auditor-client relationship might 
impose unintended costs on capital market participants. Our results are also subject to a 
caveat. Results from a regime without auditor term limits may not be applicable to a regime 
with mandatory auditor rotation because the incentives for auditors and clients might differ 
between the two regimes. 
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ABSTRACT: This study examines the value relevance of financial statement data and 
nonfinanclal statement information within and across the pre-IPO venture capital and 
post-IPO public equity markets. For a sample of U.S. blotechnology firms, | flnd that 
financial statements are highly value-relevant in the venture capital market, and that 
the signs of the associations between equity values and financial statement data in 
that market are similar to those in the public equity market, despite significant structural 
differences between the two. | also find that the value relevance of financial statements 
generally increases as firms mature, consistent with financial statements capturing the 
Increasing Intensity of assets-in-place relative to future Investment options. In contrast, 
the value relevance of nonfinancial statement information decreases as firms mature, 
indicating that, In a dynamic sense, financlal statements and nonfinancial statement 
information of venture-backed pre-IPO biotech companies are information substitutes 
in valuation, not complements. 
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I. INTRODUCTION 
uch research has shown that financial statements are value-relevant in public eq- 
M» markets (Barth et al. 2001; Kothari 2001). In this study, I break new ground 
by exploring the value relevance of financial statements in private equity mar- 
kets—specifically, in the venture capital market. In tbe process, I shed light on the dynamic 
behavior of young firms’ investment opportunity sets and on the dynamic relations of-(and 
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between) the value relevance of financial statements and nonfinancial statement information 
as firms mature from their founding through three years after they go public. 

The U.S. venture capital market is the part of the aggregate private equity market that 
consists of unregistered equity investments in young, very risky, and technology-oriented 
private firms by venture capital partnerships, corporations, pension funds, and university 
endowments (Gompers and Lerner 2000a). Although GAAP is the same for venture-backed 
private firms and publicly traded for-profit companies, the venture capital and public equity 
markets are structured very differently. The thesis of this paper is that despite these differ- 
ences, common features of underlying firm economics and common GAAP combine to 
create predictable relations between equity values, financial statement data, and nonfinancial 
statement information within and across the pre-IPO venture capital and post-IPO public 
equity markets. 

My first hypothesis is that since U.S. GAAP is designed to apply equally to all for- 
profit businesses, regardless of the market environment within which investors set prices, 
financial statements will, on average, be value-relevant in the venture capital market. 

Second, although venture-backed technology-based firms are smaller, riskier, and more 
intangible-intensive than are the vast majority of firms in public equity markets, they engage 
in fundamentally the same kinds of financing, investing, and operating activities. I therefore 
hypothesize that the direction of the associations between financial statement data and 
equity values will be the same across the two markets—that is, the partial correlations 
between equity values and financial statement data wil] be of the same signs across the two 
markets. 

Third, I hypothesize that financial statements will be value-irrelevant at the beginning 
of a firm's life but will become progressively more value-relevant as the firm matures. 
When a company of the kind that is funded by venture capital is founded, almost all its 
value is attributable to profitable expected future investment opportunities rather than assets- 
in-place. As a result, because GAAP does not allow the value of expected future investment 
options to be recorded in financial statements (only the relevant and reliable consequences 
of past transactions and activities may be included), a firm's financial statements will be 
value-irrelevant when it is founded. However, as it matures, the firm obtains financing, 
makes investments and undertakes operating activities, all of which convert potential in- 
vestment opportunities into actual assets-in-place. Assuming that a maturing firm converts 
its existing investment options into assets-in-place faster than it discovers new ones, then 
its financial statements will reflect greater and greater fractions of its total equity value. 
Hence, as a firm matures its financial statements will become increasingly associated with 
its equity value. 

Finally, I hypothesize that the value relevance of nonfinancial statement information 
will decrease with firm maturity. Nonfinancial statement information is defined as firm- 
specific information that is correlated with future investment opportunities (such as the 
number of patents issued to the firm) and non-firm-specific information that is correlated 
with the profitability of existing investments and/or industry- and economy-wide future 
investment opportunities (such as the level of equity values in the firm's industry). As firms 
mature, not only will the relative importance of assets-in-place and financial statements to 
their equity values increase, but also the relative importance of future investment oppor- 
tunities and nonfinancial statement information to their equity values will decrease. Equiv- 
alently stated, in a dynamic sense financial statements and nonfinancial statement infor- 
mation are predicted to be information substitutes in valuation, not complements. 

The data used to test these hypotheses are a specially constructed set of detailed data 
for an unbalanced panel of 204 U.S. biotechnology companies over the period 1992—2003. 
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Biotech firms are chosen because they are typicallv financed in well-defined stages by 
venture capital and go public quite quickly. The latter characteristic means that financial 
statement data spanning the entire life of the firm are often available in the IPO filing 
documents. My database exploits this by matching the pre-money venture capital valuations 
of a large set of biotech firms with up to five years of annual financial statement data in 
the IPO filings of those firms that filed to go public. This yields 481 pairs of venture capital 
firm valuations that are accompanied by the same firm's prior fiscal year's financial state- 
ments, spanning venture capital valuation points ranging from Series À to Series H financ- 
ing rounds. This dataset is extended into the public equity market by incorporating data as 
of three months after each of the first three years following the IPO (for the subset of IPO 
filers that went public). Post-IPO stock price data are matched with the previous fiscal 
year's financial statements, yielding 449 public market observations. 

Following prior research in entrepreneurial finance (Lerner 1994a; Gompers and Lerner 
1999, 20002, 2000b; Seppä 2003), hedonic log-linear regression models are used to assess 
the nature and strength of the associations between equity values, financial statements, and 
nonfinancial statement information. Two types of regressions are estimated— separate 
pooled regressions for each market, and financing-round-by-financing-round regressions 
within each market. The value relevance of financial statements is defined as the adjusted 
R? that can be uniquely attributed to financial statements in a regression of equity value on 
financial statement and nonfinancial statement information. 

The financial statement variables included in the regressions are the major components 
of a firm's balance sheet and income statement, specifically, the firm's cash balance, noncash 
assets, long-term debt, annual revenues, cost of sales, SG&A, R&D expense, and unrecog- 
nized stock option dilution. Major components of balance sheets and income statements 
are used instead of aggregate book equity and net income in order to avoid the severe 
inferential distortions that can arise when evaluating the value relevance of financial state- 
ments of fast growing, highly intangible-intensive companies (Zhang 2001; Hand 2004). 

The nonfinancial statement information included in the value-relevance regressions con- 
sists of variables that prior research in entrepreneurial finance has found to be correlated 
with the pre-money valuations of privately funded biotech companies, plus two previously 
unexamined factors. The former group consists of the AMEX Biotechnology Index (Lerner 
1994a; Gompers and Lerner 2000b; Seppä 2003), the number and scope of patents filed by 
the firm (Lerner 1994a), the total number of the firm's upstream and downstream strategic 
alliances (Nicholson et al. 2003), an indicator for when the financing round was led by a 
corporate investor (Gompers and Lerner 1999), and indicators that measure the business 
stage the firm is at (Gompers and Lerner 2000b; Seppä 2003). The new factors that I 
examine are the Sand Hill Biotechnology Index of the value of privately held and privately 
financed biotech stocks, and the degree of equity dilution created by the current financing 
round. 

Regression results are consistent with all four of the paper's hypotheses. First, biotech 
firms' financial statements are, on average, value-relevant in the pre-IPO, venture capital 
market. In pooled regressions, financial statements uniquely account for a highly significant 
13 percent of the total regression adjusted R? of 76 percent. Nonfinancial information 
uniquely explains 18 percent, and 46 percent is variation common to both financial state- 
ment and nonfinancial statement data. 

second, the nature of the associations between equity values and financial statements 
in the venture capital market are similar to those observed in the public equity market. In 
both markets, with or without controlling for nonfinancial statement information, firms' 
equity values are positively related to cash, noncash assets, and R&D expense; unrelated 
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to cost of sales; and negatively related to long-term debt and stock-option-related dilution. 
Of the coefficients on the eight financial statement variables included in the pooled regres- 
sions, only those on cash and revenues are reliably different across markets, both being 
larger in public equity markets. 

Third, biotech firms’ financial statements are value-irrelevant at Series A financ- 
ings—the earliest formal venture capital financing round—but become progressively more 
value-relevant as firms mature. Consistent with the proposition that this is due to firms 
converting their future investment options into assets-in-place faster than they discover new 
ones, the ratio of total assets to firm value reliably increases as firms mature. These patterns 
are not a spurious effect of scale or other factors that can impede across-sample comparisons 
of adjusted R? (Gu 2002). 

Fourth, financial statement data and nonfinancial statement information are informa- 
tional substitutes rather than complements. As biotech firms mature and the value relevance 
of their financial statements increases, the value relevance of nonfinancial statement infor- 
mation decreases. The time-series correlation between the value relevance of financial state- 
ment data and the value relevance of nonfinancial statement information is reliably negative. 

Last, I find that the value relevance of financial statement data and nonfinancial state- 
ment information are quite different at the IPO than expected from the maturity-based trends 
noted above. At the IPO, the value relevance of financial statement data dips temporarily 
below the maturity trendline, while the value relevance of nonfinancial statement infor- 
mation rises temporarily. Consistent with arguments made by finance research into other 
IPO pricing anomalies, I attribute this to the notion that IPOs are sold by sophisticated 
underwriters to unsophisticated retail investors who underweight the value relevance of 
financial statement data and overweight the value relevance of nonfinancial statement 
information. 

In total, this paper contributes to several literatures. It expands the value-relevance 
literature by finding that financial statements in the pre-IPO venture capital subsector of 
the private equity market are qualitatively and quantitatively value-relevant in economically 
reasonable ways. This indicates that the concepts underlying U.S. GAAP are broad and 
robust. As intended by the FASB, GAAP “‘provide[s] information that is useful to present 
and potential investors ... in making rational investment, credit, and similar decisions" 
(FASB 1978, Concepts Statement No. 1, para. 34), regardless of whether the market in 
which investment decisions are made is private or public. This robustness exists despite 
large differences in price-setting mechanisms, the importance of assets-in-place versus fu- 
ture investment opportunities, and the level of regulation, liquidity, uncertainty, and infor- 
mation asymmetries between venture and public equity markets. 

The paper also contributes to the accounting literature on the investment opportunity 
set. Prior work has found that in the cross-section, firms' investment opportunities are 
important determinants of many corporate decisions (Smith and Watts 1992; Gaver and 
Gaver 1993; Skinner 1993). This study demonstrates that not only do firms' investment 
opportunity sets decline as they mature, but also the decline is predictably related to both 
increasing value relevance for financial statements and decreasing value relevance for non- 
financial statement information. The increasing value relevance of financial statements as 
young firms mature also bookends the decreasing value relevance of financial statement 
data found by Barth et al. (1998) for firms approaching the ends of their lives in the form 
of sharply declining financial health and/or bankruptcy. 

With respect to the accounting literature on the role of nonfinancial information in 
valuation, this paper contributes new knowledge beyond the dynamic relations observed 
between firms' investment opportunities and the value relevance of nonfinancial statement 
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information. Specifically, although I concur with prior studies such as Amir and Lev (1996), 
Ittner and Larcker (1998), Deng et al. (1999), and Trueman et al. (2000) in their conclusions 
that nonfinancial information can be value-relevant, I find that for biotech firms, nonfinan- 
cial statement information is only materially value-relevant prior to filing for an IPO. In 
the post-IPO market, nonfinancial statement information seems to be almost entirely value- 
irrelevant once the value relevance of financial statements is controlled for. Moreover, I 
find that financial statement data and nonfinancial statement information are substitutes, not 
complements, in valuation. This runs counter to the findings of Amir and Lev (1996). 
Finally, this paper contributes to entrepreneurial finance, which has not previously ex- 
amined the relevance of financial statements. I show that financial statements are highly 
related to pre-IPO firm equity values. I also show that when venture capital valuations are 
pooled over all pre-IPO financing rounds, nonfinancial statement information of the type 
examined previously by venture capital researchers still has explanatory power once finan- 
cial statement data are controlled for. I further identify two new value-relevant items of 
nonfinancial statement information—the level of biotech company valuations in the entire 
venture capital market, and the degree of new equity dilution created by the current round 
of funding. The results regarding dilution strongly suggest that the demand for equity is 
far from perfectly elastic in the venture capital market. In addition, not controlling for 
financial statements leads to the erroneous conclusion that the value relevance of nonfinan- 
cial information does not change as the firm matures, when, in fact, it declines sharply. 
The remainder of the paper proceeds as follows. Section II describes the institutional 
background of the U.S. private equity market, particularly venture capital, and provides a 
sketch of biotechnology and business research into biotech firms. Section III develops the 
value-relevance hypotheses. Section IV explains the sample selection, describes the database 
employed, and reports descriptive statistics. Section V motivates the regression models that 
are used. Section VI reports regression results along with the findings from various ro- 
bustness diagnostics and notes the key limitations of the study. Section VII concludes. 


II. THE VENTURE CAPITAL MARKET AND BIOTECHNOLOGY 

Private Equity and Venture Capital 

There are four types of private equity: organized, angel, informal, and Rule 144A (Fenn 
et al. 1995).! Venture capital is the subpart of organized private equity that invests in young, 
typically high-technology, firms. Venture capital funds are usually structured as partnerships 
that raise money in staged amounts from companies, institutions, and wealthy private in- 
vestors. À fund generally has a ten-year life and invests in a portfolio of private companies, 
often focused in one or two sectors such as biotechnology or software (Sahlman 1990). 
Although venture capitalists put up only about 1 percent of the limited partnership's capital, 
they manage the fund through their role as the general partners. In exchange for finding, 
screening, and deciding upon which companies to invest in, they are paid an annual man- 
agement fee that is usually between 1.5 percent and 3 percent of the fund's committed 


t Organized private equity is unregistered investments in the equity of private companies, made either directly by 
professional investors such as corporations, pension funds, and endowments or indirectly by these investors 
through intermediaries, particularly venture capital partnerships. Ángel private equity consists of investments by 
wealthy individuals, typically arranged by matchmakers such as lawyers and accountants (Wong 2002). Informal 
private equity is similar to angel capital except that firms sell unregistered equity securities to both institutional 
investors and wealthy individuals across a larger number of such investors. Rule 144A private equity is private 
equity offerings underwritten under the SEC’s Rule 144A, which establishes the rules and conditions under 
which private securities are permitted to be traded among certain classes of institutional investors (Fenn et al. 
1995). 
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capital or net asset value, and receive about 20 percent of the profits made by the fund's 
investments. The typical investment made by a venture fund in a company is in the form 
of illiquid preferred stock that only converts into liquid common stock or cash at one of 
two major exit points—an IPO, or the sale of the company to another entity. Although 
venture funds provide a firm with a measure of long-term financing through their invest- 
ments in several financing rounds, they also supply business expertise and connections. A 
partner from the venture fund usually serves on the firm's board of directors; provides 
financial sophistication, operating services, and business contacts; helps recruit key person- 
nel; and imparts financial and strategic discipline. 


Distinctives of the Venture Capital Market 


The venture capital market differs from the public equity market in its institutional 
structure and the type of firms that it finances (Wright and Robbie 1998; Gompers and 
Lerner 2000a). In terms of the number, valuation, and diversity of firms involved, the 
venture capital market is much smaller than is the public equity market. Whereas the public 
equity market is highly regulated by the SEC and stock exchanges, the venture capital 
market is not. Firm valuations in the venture capital market are set through face-to-face 
negotiations between management and a small number of wealthy, professional, and risk- 
tolerant investors. Valuations in the public equity market are set anonymously without direct 
contact with management via the interactions of large numbers of investors, many of whom 
are risk-averse and do not have significant wealth or professional investing experience. 
Unlike in public markets, the valuations of individual venture-backed firms are difficult and 
costly for nonparticipating investors to observe. Trading is cheap and all but continuous in 
the public equity market, but it is expensive and rare in the venture capital market. The 
trading that does occur in the venture capital market is almost entirely in the form of long 
investments in new equity, not secondary transactions in existing shares. Venture capital 
investments are also far less divisible than are public equities. Short selling is feasible and 
frequent in the public market, but nonexistent in the venture capital market. Venture capital 
investors are more likely to be able to extract management's private information because 
of the board positions they hold and their frequent interactions with and monitoring of 
management. In contrast, public equity investors typically engage in indirect monitoring, 
rely primarily on public data, and face significant information asymmetries. In total, the 
venture capital market is illiquid, incomplete, and imperfect—to a much greater degree than 
is the public equity market. 


Research into Venture-Backed Companies 


Research into venture capital has blossomed over the past decade. An excellent sum- 
mary is provided by Gompers and Lerner (20002), spanning topics such as the compen- 
sation of venture capitalists (Gompers and Lerner 1999); the optimal investment, monitor- 
ing, and staging of venture capital (Gompers 1995); the decision to go public (Lerner 
1994b); and the long-run performance of venture-backed IPOs (Brav and Gompers 1997). 
Most recent work includes that of Kaplan and Stromberg (2003, 2004), who empirically 
relate the characteristics of venture capital contracts to theories of financial contracting, and 
Seppä (2003), who examines the relations between investment syndication and the effi- 
ciency of venture capital firms, as well as the role of network positions in determining the 
performance of venture capital firms. 
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Biotechnology and Biotechnology Firms 


Biotechnology is the application of technology to the life sciences, wherein living cells 
or their processes are used to solve problems and perform specific industrial or manufac- 
turing processes. Biotech applications include the production of drugs, synthetic hormones, 
and bulk foodstuffs; the bioconversion of organic waste; and the use of genetically altered 
bacteria. Biotech firms are highly dependent on the intellectual property (ideas, discoveries, 
patents) generated through their large R&D expenditures and, as such, are among the most 
intangible-intensive of businesses. The value chain of the typical biotech firm stretches 
some 10—15 years from founding through patenting to successful FDA approval and product 
sales (see Figure 1). Biotechnology is therefore a very risky but potentially very lucrative 
equity investment. 

The prototvpical young biotech firm is in an intense R&D race against competitors to 
discover and patent a new drug. As such, it has large capital needs over a long period of 
time. In the early stages of life, the firm's capital needs are met by venture capital and 
strategic equity investments from pharmaceutical companies. However, capital needs typi- 
cally become so large that they can only be satisfied through an IPO or a buyout by a large 
pharmaceutical company. Successful biotech firms therefore tend to go public rapidly, and 
it is not uncommon for a biotech fixm's S-1 filing with the SEC to contain financial state- 
ments that span its entire life. 

Economic research into biotechnology has spanned three areas: intellectual capital 
(Zucker and Darby 1996; Zucker et al. 1998), strategic alliances (Robinson and Stuart 
2002), and valuation (Stuart et al. 1999; Nicholson et al. 2003; Darby et al. 1999). In 
accounting, Joos (2002) finds that the level and rate of growth in R&D expense, R&D 
success, and competitive structure explain cross-sectional variation in market-to-book ratios 
for pharmaceutical drug manufacturers. Ely et al. (2003) conclude that the average FDA 
stage of a firm's portfolio of drugs conditions the value relevance of the firm's R&D 
expenditures. Guo et al. (2004) find that biotech firms' disclosures affect their bid-ask 
spreads and stock return volatility. Hand (2004) reports that the mapping of publicly traded 
biotech firns' R&D expenditures into equity market value is a function of the location of 
R&D in the biotech value chain of discovery, development, and commercialization, as well 
as the growth rate in R&D spending. 


HI. HYPOTHESES 

The goals of this paper are to develop and test hypotheses about the value relevance 
of financial statement data and nonfinancial statement information within and across the 
venture capital and public equity markets. The thesis of the paper is that despite significant 
economic and structural differences between the two markets, common features of under- 
lying firm economics and common GAAP combine to create predictable relations between 
equity values, financial statement data, and nonfinancial statement information. 

The first hypothesis stems from the fact that the FASB's Conceptual Statements and 
Issued Standards are designed to be useful to investors in all types of for-profit businesses, 
not just publicly traded entities. Although firms in the venture capital market have more 
concentrated ownership and are smaller, riskier, and more intangible-intensive than firms 
in the public market, and although it has been argued that U.S. GAAP was developed for 
tangible-intensive businesses and is therefore less well suited to intangible-intensive tech- 
nology firms (Lev and Zarowin 1999; Lev 2001), it seems unlikely that financial statements 
would be entirely without value relevance for all companies at all points prior to filing for 
an IPO. This is because all firms in both markets engage in financing, investing, and op- 
erating activities—only on different scales and to different degrees. The first hypothesis is 
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FIGURE 1 
Biotechnology Value Chain 
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Source: Ernst & Young LLP, Biotechnology Industry Report Convergence , 2000. 


Discovery and Preclinical Testing: The drug development process usually begins with the scientific 
discovery of a gene or other biological pathway involved in a disease. Discovery can take 2—10 years. From 
discovery, a target for therapeutic intervention is established. Preclinical tests are conducted in the lab using 
individual cells or sometimes animals to evaluate the safety and potential for effectiveness in humans. If the 
target is determined to be legitimate, the company files an Investigative New Drug (IND) application with the 
FDA for clearance to begin testing on humans. Even after these first few years of research and testing, however, 
most new drug candidates will never make it to the market. 

Phase I Trials: Human testing begins. The purpose of a Phase I trial is to use a small number of volunteer 
patients to establish basic safety and maximum dosage parameters. 

Phase II Trials: This stage of clinical study is much more involved, requiring many months to plan, set up, 
and recruit trial participants. Phase H is conducted on a larger group of patients with the targeted disease to 
study the efficacy of the drug at various doses and confirm its safety. They typically use blinding and placebo 
controls to achieve scientifically sound results. Phase II often lasts two years, and sometimes a drug will 
undergo multiple Phase II trials for different indications (for example, to treat different types of cancer). This 
may be the most critical phase in terms of sorting winners from losers. As a rule of thumb, drugs that complete 
Phase If and move on to Phase III have about a 50 percent success rate of reaching the market, though some 
studies suggest the rate is higher. 

Phase III Trials: These tests are designed with a specific endpoint—a measurable result that clearly 
demonstrates success in combating the targeted disease. The endpoint must be agreed upon by the FDA as an 
outcome that will lead to marketing approval. The trial involves a large group from the targeted patient 
population and uses controls such as double-blinding (neither patient nor doctor knows who is getting a 
placebo). Multicenter trials are common to show that results are reproducible when administered in different 
clinical settings. This pivotal phase often lasts two to three years from initial design to study completion, and 
here again it is common for drugs to undergo more than one Phase III trial for different indications ór to support 
different therapy combinations. 

FDA Approval Process: If a drug successfully completes Phase III, then the company gathers all of its 
clinical data and files an application for marketing approval with the FDA. It often takes three to six months just 
to prepare the application. Another six to 12 months can pass before an FDA advisory panel reviews the 
application and makes a recommendation. This advisory panel has expertise in the drug's specific area of 
therapeutic or disease characteristics, and its recommendation for denial or approval is normally followed by the 
FDA (though another six to 12 months can pass before that happens). 

Modified from an article by James Hale, available at: 
http: // www.theonlineinvestor.com/ industries.phtml?content-- is... bio2. 
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therefore that the financial statements will on average be value-relevant in the venture 
capital market. 

The second hypothesis extends this line of reasoning. Because U.S. GAAP does not 
depend on whether a for-profit firm is privately held or publicly traded, I hypothesize that 
the nature of the associations between equity values and financial statement data in the 
venture capital market will accord with economic intuition and will be the same as those 
in the public market. That is, I predict that the partial correlations between equity values 
and key financial statement items will have economically sensible signs, and that the signs 
will be the same across markets. For example, the GAAP definition of noncash assets leads 
to the prediction that the partial correlation between firms' equity values and noncash assets 
will be positive, not negative, but that the converse will be true for a liability such as long- 
term debt. 

The third hypothesis is that financial statements will be value-irrelevant at the beginning 
of a firm's life, but will become progressively more value-relevant as the firm matures. As 
highlighted by Miller and Modigliani (1961), the value of a firm's equity can be broken 
into two parts: the present value of net assets-in-place as a result of past investments, and 
the present value of future profitable investments or growth opportunities. Growth oppor- 
tunities are the real options that a firm has or may create to make future investments that 
earn a rate of return in excess of its opportunity cost of capital (Myers 1977). Growth 
opportunities are frequently referred to as the firm's "investment opportunity set" and have 
been found to be an important determinant of many corporate decisions (Adam and Goyal 
2003)? 

When a firm is founded, its value consists almost exclusively of ideas the founders 
have for profitable future investments—viz., investment/growth opportunities—rather than 
on-balance-sheet assets-in-place (Myers 1977). As the firm matures, its investment oppor- 
tunities are financed and converted into tangible and intangible assets and liabilities re- 
corded in financial statements through double-entry bookkeeping (e.g., cash, PP&E, R&D, 
trademarks, debt). One result of this process is that the fraction of the firm's equity value 
attributable to assets-in-place increases relative to that which is attributable to future in- 
vestment opportunities. This view is consistent with the conventional start-up/growth/ 
maturity /decline model of the lifecycle of a firm (Smith et al. 1985; Black 1998).* 

U.S. GAAP, which applies equally to private and public for-profit firms, does not allow 
the value of expected future investment opportunities to be recorded in financial statements, 
only the relevant and reliable assets and liabilities associated with past investments. This 
means that as a firm matures and the fraction of value attributable to its assets-in-place 


? For example, the relative mix of assets-in-place versus future investment opportunities affects a firm's accounting 
policies (Skinner 1993), capital structure (Myers 1977; Smith and Watts 1992), compensation contracts (Smith 
and Watts 1992; Gaver and Gaver 1993), and dividend policy (Smith and Watts 1992). 

3 Although it is possible that a maturing firm could maintain a nondecreasing ratio of value from assets-in-place 
relative to value from future investment opportunities, this would require the firm to continually create new and 
increasingly large future investment opportunities. In expectation, this is economically unlikely. 

* [n the start-up phase there are few assets-in-place and the largest fraction of a firm's value stems from its ideas, 
intellectual property, and growth opportunities. In the growth phase, financing has been obtained, investments 
have been made, and operating activities are underway. Although the fraction of firm value attributable to assets- 
in-place is higher than in the start-up phase, growth opportunities remain large and near-term financing require- 
ments remain substantial. In the maturity phase of the life cycle, the firm's growth has slowed substantially, its 
assets-in-place dominate its valuation, and its cash from operations is strong enough that it does not need external 
financing. Finally, in the stagnant or decline phase, the firm is either in a no-real-growth steady state where its 
earlier abnormal profits have all been competed away, or the firm is destroying shareholder value in its invest- 
ments and operations, making it a candidate for restructuring, a takeover, or liquidation (Smith et al. 1985). 
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increases relative to that of its future investment opportunities, the firm's financial state- 
ments will reflect a greater and greater fraction of the firm's value. A key manifestation of 
this linkage will therefore be that the value relevance of a firm's financial statements will 
increase as the firm matures. 

My fourth hypothesis stems from the observation that financial statements are not the 
only source of value-relevant information. Specifically, I propose that as a firm matures, 
the value relevance of its nonfinancial statement information will decrease. Nonfinancial 
statement information is defined as firm-specific data that are correlated with future in- 
vestment opportunities (e.g., the number of patents issued to a firm) as well as non-firm- 
specific information that is correlated with the profitability of existing investments and/or 
industry- and economy-wide future investment opportunities (e.g., the level of equity values 
in a firm's industry). As firms mature and convert investment opportunities into assets-in- 
place, not only will the relative importance to equity values of assets-in-place and financial 
statements increase, but also the relative importance of future investment opportunities and 
nonfinancial statement information will decrease. That is, I predict that in a dynamic sense 
financial statements and nonfinancial statement information will be informational substitutes 
in valuation, not complements. It should be noted that this fourth hypothesis does not 
automatically follow if the third hypothesis is true. It could logically be the case that the 
value relevance of both financial statement data and nonfinancial statement information 
both increase as firms mature. 


IV. DATA 

This section describes the construction and characteristics of a database that combines 
private equity valuations, financial statement data, and nonfinancial statement information. 
The strengtlis of the database are that it is large, not unduly clustered in time, and rich in 
different kinds of information. It contains a wide variety of longitudinal financing points 
(Series A through Series H financing rounds, as well as many nonfinancing public valuation 
points) made by firms that either successfully or unsuccessfully tried to go public. The 
database has two main limitations. It does not contain venture capital valuations for firms 
that did not file to go public because they went bankrupt, merged, or chose to remain 
private. If financial statement data are more value-relevant for companies that were suc- 
cessful in that they filed to go public, then there may be an upward bias in the measured 
pre-IPO value relevance of financial statements. This concern is addressed in Section VI. 
Also, all of the IPOs from which financial statement data are taken were filed between 
9/29/95 and 12/14/01, which is only approximately a six-year period. 


Equity Valuation Data 

Venture capital firm valuations were purchased from Recombinant Capital (Recap, visit 
http: //www.Recap.com), a West Coast consulting firm that specializes in collecting and 
selling information on biotechnology companies. Recap has gathered what it indicates is a 
full set of round-by-round financings for each of more than 600 biotech companies begin- 
ning in the early 1980s. Recap's equity valuation histories come mainly from IPO filing 
documents and SEC filings by companies that acquire biotech firms.? The valuation histories 


5 The data items in Recap's valuation file consist of company name; the type of financing round (e.g., Founding, 
Series A, IPO, etc.); the category of the round (e.g., Founder/Insider, Venture, Corp. Private, etc.); the date the 
round was filed (that is, begun); the date the round was concluded; the amount of dollars raised in the round, 
if any; the price per share of that round’s financing; the post-money valuation; the number of shares involved 
(although this is infrequent); and the names of up to three investors, with tbe first investor being the lead investor 
in the round. 
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are based on primary preferred and common share issuances from the date of founding 
until the most recent equity financing or other share issuance (which may be after a suc- 
cessful IPO). The valuation date is the date the terms of the financing round were agreed 
on by the firm and the venture investors. Preferred shares are converted into common at 
the ratios specified in the financing agreements. 

Post-IPO firm equity values were obtained from standard online sources. Three post- 
IPO valuation dates were used, namely three months after the first, second, and third fiscal 
year-ends following the IPO offering date (if an IPO occurred, since not all biotech firms 
in the sample that filed to go public successfully accomplished an offering). 


Financial Statement Data 


For the subset of firms in Recap's database that filed to go public, pre-IPO financial 
statement data were collected from S-1 and 424B documents when they were available 
online at http://www.sec.gov. This exploits the fact that when a firm files to go public, it 
has to provide five years' worth of audited (albeit coarse) historical financial statement 
data.? Post-IPO financial statements were taken from 10-Ks. 

In the pre-IPO part of the total dataset, on a firm-by-firm basis, each year of pre-IPO 
financial statement data was (where possible) matched with the first—and only the 
first—venture capital pre-money valuation following the fiscal year-end, providing that the 
valuation date was less than a year after the fiscal year-end.’ No set of annual financial 
statements was matched with more than one pre-money valuation. This process yielded 481 
pairs of pre-money venture capital valuations for which there were financial data from the 
preceding fiscal year, covering a total of 193 firms. To be considered usable, a firm's cash 
balance, SG&A expense, and R&D expense had to be positive.* In the post-IPO part of the 
database, for each biotech firm that successfully went public, its equity market value was 
(where possible) noted three months after each of the first three fiscal year-ends after its 
IPO. These equity market values were then matched with the preceding fiscal year's finan- 
cial statement data, yielding 441 pairs of public market valuations for which there were 
financial data from the preceding fiscal year, covering a total of 160 firms. 

The financial statement items included in the database were three primary components 
of total shareholder equity— cash, noncash assets, and long-term debt—and the main com- 
ponents of biotech firms’ net income, namely annual revenues, cost of sales, SG&A costs, 


$ I am unaware of any other systematic, large-sample source of financial statement data for pre-IPO companies. 
Venture capital funds contacted during the course of this research indicated that they keep imperfect historical 
records of the financial statements of companies they invest in. Noting that "accounting data for private firms 
is unavailable," Gompers (1995) uses SIC industry averages from Compustat for each venture capital-funded 
firm in his analysis of the determinants of the duration of, and funding amounts provided to, venture capital- 
backed firms. 

7 For example, if a firm with a calendar year-end filed to go public on 11/5/98, and had Series A, B, and C 
rounds on 1/6/96, 10/27/96 and 12/5/97, respectively, the Series A and Series C would be usable, but the 
Series B would not because it is the second financing in 1996. In my design, only the first financing in 1996 
can be matched with financial statements for the year ended 12/31/95 because it not possible to know how the 
financing on 1/6/96 affected financial statements as of 10/27/96. The strength of this approach is that no 
private equity market valuations are matched to what I will call ‘financing contaminated” financial statements. 
The limitation is that the gap between the valuation financing date and the preceding fiscal year end can be as 
little as a few days and as Jong as almost a year. However, the noise created by this weakness is unlikely to 
create regression biases in favor of my hypotheses. 

* A biotech firm that bas no cash, or no SG&A, or no spending on R&D, is highly unusual. Approximately 2 
percent of the initial observations were deleted as a result of these restrictions. 
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and R&D expense.? Long-term debt includes capitalized lease obligations. Revenue is the 
sum of collaborative, contract, grant, license, research, and product revenues. In addition 
to these recognized numbers, the degree of stock option dilution was included, defined as 
the number of shares under option divided by the number of shares outstanding. Stock 
option dilution Is a proxy for economic cost of stock options that is almost never recognized 
in firms' financial statements. Stock option data are sometimes missing because firms are 
only required to provide three years of pre-IPO filing stock option data in their S-1 or 
424B-4 filings. To account for this, an indicator variable is included in the regressions and 
is set equal to 1 if stock option data are missing, and O otherwise. Missing stock option 
dilution observations are then reset to 0. 

The GAAP definitions of assets and liabilities lead to the prediction that in a regression 
of equity values on financial statement data, the coefficients on cash and noncash assets in 
both venture capital and the public equity markets will be positive, while the coefficients 
on long-term debt and stock option dilution will be negative. For income statement varia- 
bles, I expect to observe a positive coefficient on revenues to the extent that revenues are 
not entirely transitory, a positive coefficient on R&D because the bulk of the benefits from 
R&D emerge in future periods, and a negative coefficient on cost of sales.!? I make no sign 
prediction on SG&A, because SG&A is a mixture of period expenses that would be ex- 
pected to be negatively related to equity value, such as the rent on the firm's facilities, and 
certain costs that provide future benefits, such as salaries for senior management and key 
scientific personnel, which would be expected to be positively related to equity value due 
to their multiperiod nature. 

The regressions that I estimate characterize firms' equity values as a function of the 
major components of their book equity and net income rather than their book equity and 
net income per se, because young biotech firms make highly intensive and rapidly growing 
investments in R&D and therefore have significantly biased financial statements. Zhang 
(2001) demonstrates analytically that accounting conservatism combined with rapid growth 
in intangible assets can dramatically distort the associations between aggregate financial 
statement data and equity value. If the firm's investments in intangible assets are sufficiently 
intense, reported losses will be negatively, not positively, associated with equity values. 
Hand (2003) confirms Zhang's (2001) predictions for intangible-intensive Internet compa- 
nies and demonstrates that the distorted relations between equity values and aggregate 
financial statement data for intangible-intensive firms can be avoided by replacing book 


° By focusing on major components, this approach omits minor components. On the balance sheet, total assets 
= cash + noncash assets = total liabilities + convertible preferred stcck + stockholders' equity. Of total 
liabilities, I include only long-term debt (defined as including long-term notes payable) because current liabilities 
are not identified in the five-year historical summary of coarse financial statement data in S-1 filings. If the sum 
of current liabilities and noncurrent liabilities outside of long-term debt is estimated from the data that are 
available, and is included in the value-relevance regressions, no inferences are changed. In biotech firms' income 
statements, net income — revenues — cost of sales — SGA — R&D - interest income — interest expense — 
income tax expense +/— miscellaneous items. Interest income and interest expense are excluded from the value- 
relevance regressions because they are directly and causally related to firms' cash holdings and long-term debt. 
Income tax expense is omitted because it is rarely present, as are miscellaneous items. Net income attributable 
to common shareholders — net income — accretion to redemption value and/or accrued dividends related to 
beneficial conversion features of preferred stock. The latter variable is not included in the regressions because 
the corresponding preferred stock is effectively included in the selection of the balance sheet items cash, noncash 
assets, and long-term debt (since cash + noncash assets — long-term debt = current liabilities + long-term 
liabilities outside of long-term debt + preferred stock outside of stockholders' equity + stockholders' equity). 

10 Because cost of sales is only recorded for product sales, and young biotech firms typically earn revenues from 
several other sources before they are mature enough to generate product sales (collaborations, contracts, grants, 
licenses, and research), few firms in the venture capital market portion of the database reported cost of sales. 
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equity and net income with their key components, such as the major categories of assets, 
liabilities, revenues, and expenses. This prevents the associations between intangible assets/ 
expenses and equity value from contaminating the associations between tangible 
assets/expenses and equity value. 


Nonfinancial Statement Information 


A substantial amount of nonfinancial statement information was collected for this study 
in order to include in the value-relevance regressions many of the variables that prior work 
has found to be value-relevant for venture-backed biotech firms. Beyond these, I also ex- 
plore the value relevance of two new nonfinancial statement factors. The nonfinancial state- 
ment information that was collected comprises the following items: 


(1) The level of the AMEX Biotechnology Index on the valuation date. Lerner (19942), 
Gompers and Lerner (2000b), and Seppá (2003) find that biotech firms' pre-money 
valuations are larger the higher are overall publicly traded biotech equity values, 
because overall publicly traded biotech equity values provide a strong indication 
of the economic prospects of the biotech sector as a whole.!! 

(2) The age of the firm. Gompers and Lerner (1999), Nicholson et al. (2003), and 
Seppä (2003) find that the older biotech firms have larger pre-money valuations. 
They attribute this to older firms being less risky. 

(3) The number and scope of the patents filed by the firm on the valuation date. Lerner 
(19942) argues that intellectual property is a young biotech company's most val- 
uable asset. To test this proposition, he computes the number of patents issued to 
a firm at the valuation date and develops a proxy for patent scope using the Inter- 
national Patent Classification scheme.'* He finds that both the number and scope 
of biotech firms' issued patents are reliably positively associated with pre-money 
valuations. Lerner (1994a) also develops a proxy for patent uniqueness, which for 
parsimony I do not include in this study. I adopt the same general approach as 
Lerner (1994a), except that I use patents filed, rather than patents issued, as of the 
valuation date. Results are not sensitive to this distinction, but conversations with 
venture capitalists and patent lawyers specializing in biotechnology indicate that 
the average lag of about three years between when a patent is first filed and when 
it is issued leads venture capitalists to significantly weight patent filings. Patent 
data were extracted from http: // www.uspto.gov. 

(4) The total number of the firm's upstream and downstream strategic alliances. Con- 
sistent with the view that alliances with pharmaceutical companies are credible 
endorsements of a biotech company's prospects, Nicholson et al. (2003) report that 
biotech companies that sign alliance deals with pharmaceutical companies (so- 
called “downstream” deals) are valued significantly higher by venture capitalists. 
I expand this type of analysis to include "upstream" deals, such as that between 


!! The Amex Biotechnology Index (BTK) is an equal-dollar-weighted index designed to measure the performance 
of a cross-section of companies in the biotechnology industry that are primarily involved in the use of biological 
processes to develop products or provide services. Such processes include, but are not limited to, recombinant 
DNA technology, molecular biology, genetic engineering, monoclonal antibody-based technology, lipid/liposome 
technology, and genomics. The BTK Index was established with a benchmark value of 200 on October 18, 
1991. It is rebalanced quarterly based on closing prices on the third Friday in January, April, July, and October 
to ensure that each component stock continues to have approximately equal weight in the index. 

12 For an explanation of the International Patent Classification scheme, and how it differs from the U.S. classifi- 
cation scheme, see Lerner (1994a, 320—321). 
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a biotech firm and a university. I predict that the greater the number of a firm's 
total alliances, the higher will be its equity value. Alliance data were obtained from 
http: // www.recap.com. 

An indicator for whether a private equity financing round was led by a corporate 
investor. Gompers and Lerner (1999) find that corporate investors assign higher 
pre-money valuations when investing in private equity financings than do venture 
capital funds, either because corporations perceive there to be strategic synergies 
available, or because corporations typically engage in less monitoring and business 
development than do venture capital funds and therefore earn a lower expected 
return on their investment. Data on whether a venture capital financing round was 
led by a corporate investor were obtained from http: // www.recap.com. 

Indicator variables that measure the business stage of the firm. Gompers and Lerner 
(1999, 2000b) and Seppä (2003) control for firm maturity using firm age and either 
the round number (e.g., first, second, third, etc.) or the firm's business stage (start- 
up, development, beta, shipping, profitable, or restart). Unlike the VentureOne da- 
tabase used in most prior studies, the Recap database codes the Series of the fi- 
nancing round—whether the round is Series A, Series B, etc.—rather than the 
firm's business stage. Although I do not make predictions on the signs of Series 
indicators, they play a important role in mitigating the possible selection bias aris- 
ing from the fact that only firms that file for an IPO are included in the sample. 
This is because selection bias is likely to be more severe across rounds than within 
rounds. Financing round indicators also help control for discount rates that are 
expected to decline as firms mature. 


Beyond these, Í also examine the value relevance of two new nonfinancial statement 
factors: 


The Sand Hill Biotechnology (SHB) Index of the value of privately held and fi- 
nanced biotech stocks. The SHB Index is a proxy for the biotech industry's overall 
prospects, separate from that of the AMEX Biotech Index of publicly traded biotech 
stocks. I therefore expect it to be positively related to equity values. As described 
by its developer, Sand Hill Econometrics of Palo Alto, the SHB Index is a propri- 
etary, unbiased index of the venture capital value of pre-IPO privately funded bio- 
tech companies. The SHB Index had a benchmark value of 100 in January 1989. 
Using bias-corrected data on company values, Sand Hill Econometrics builds the 
Index with tools akin to those used for repeat-sales indices of houses and other 
assets.!3 


(8) The degree of equity dilution created by the current financing round, defined as 


the number of shares issued to new investors divided by the number of pre-money 
shares outstanding. This variable measures the degree to which the venture capital 
market is imperfect in that the demand for equity is less than perfectly elastic. A 
negative coefficient on dilution implies that demand for the firm's equity is less 


P According to Sand Hill Econometrics, in dollar terms the Index covers virtually all private biotech companies 
in the United States that have raised money from outside investors since 1987. The valuation events used to 
construct the Index include funding rounds, IPOs, acquisitions, liquidations, and shutdowns. The Index addresses 
the measurement problems unique to venture investing, namely intermittent pricing and selection bias. The Index 
is built from individual company data, not venture-fund data, because company-level data allow for the mea- 
surement of value at a point in time and for statistical correction for selection bias. Sand Hill Econometrics' 
methods correct for bias in the reporting of value for known deals and for bias in the likelihood of companies 
falling out of available data sources. Companies doing poorly are less likely to report value and more likely to 
be left out of subsequent systematic capture of funding round or shutdown information. 
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‘than perfectly elastic because the price at which new investors are willing to invest 
is decreasing in the number of new shares to be issued. Venture capital markets 
may be imperfect in this way because the availability of strong substitutes for any 
given equity investment are more limited than in public equity markets, given the 
much smaller size of the venture capital market and the very illiquid nature of 
venture capital investments. Prior work by Gompers and Lerner (2000b) and Seppá 
(2003) has also examined whether venture capital valuations were affected by de- 
mand, using the total amount of funding committed to U.S. venture capital funds 
in the 12 months leading up to the valuation date. Both studies found evidence that 
inflows of capital into venture funds increase the valuations of funds’ new invest- 
ments; that is, firm valuations get inflated because “too much money is chasing 
too few deals." The advantage of using dilution created by the current financing 
round instead of annual inflows of capital into venture funds as a proxy when 
testing for less than perfectly elastic demand is that dilution created by the current 
financing round is firm-specific. If the demand for equity in the venture capital 
market is indeed less than perfectly elastic, then the dilution created by the current 
financing round will be negatively associated with pre-money valuations. Moreover, 
if dilution created by the current financing round is a superior proxy, it should also 
statistically dominate the association between equity value and annual inflows of 
capital into venture funds. 


Descriptive Statistics 

Table 1 provides descriptive statistics on the major financial statement and nonfinancial 
statement information items for the sample biotech firms. Private equity financings are 
evenly spread over the ten-year period 1992-2001, and public market valuation dates over 
the eight-year period 1996—2003 (Panel A). The distribution of valuation observations 
across the venture capital and public equity markets is such that the vast majority of sample 
firms are represented in both the venture capital market and the public equity market da- 
tasets, rather than only one of the two (Panel B). Of the total sample of 204 firms, 193 
have at least one valid valuation in the venture capital market, and 160 have at least one 
valid valuation in the private equity market. Also, 85 percent (136/160) of firms in the 
private equity market dataset have valuations for each of the maximum of three post-IPO 
years, while 82 percent (158/193) of firms in the venture capital market dataset have two 
or more valuations. 

Most biotech firms file to go public quite rapidly, mainly between four and six years 
after they were founded (Panel C). The IPO filing date for the median firm occurs during 
preclinical testing and/or Phase I trials (see Figure 1), although a few firms take much 
longer to file to go public. The majority of sample companies are in pharmaceutical prep- 
arations, SIC 2834, or commercial, physical, and biological research, SIC 8731 (Panel D). 
Despite this concentration, sample firms comprise a total of 22 different four-digit SIC 
codes. Of firms, 47 percent had their headquarters at the time of the IPO filing in California 
and 14 percent were in Massachusetts (Panel E). Headquarters clustering arises because 
California and Massachusetts contain many top universities with star scientists on their 
faculties (Zucker et al. 1998). 

Table 2 reports the median values of various financial statement and nonfinancial var- 
iables in private and public markets on a round-by-round basis. Table 2 is organized from 
left to right in the typical sequence followed by a firm funded by private equity, beginning 
with the Series A financing. The number of observations at a given private equity round 
differs from the number of firms in the sample because not all firms file to go public after 
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their Series G or H round; some go public as early as after their Series A round. In addition, 
a given set of annual financial statements is only ever matched to one financing round. For 
example, if a calendar-fiscal-year firm had a Series B financing on 1/10/96 and a Series 
C financing on 12/28/96, only the Series B valuation—matched to the 12/31/95 financial 





TABLE 1 
Descriptive Statistics on 481 Venture Capital Market Valuations and 449 Public Equity 
Market Valuations for an Unbalanced Panel of 204 U.S. Biotechnology Firms, 1992—2003 
Panel À: Distríbution of Valuation Dates 
1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 
Venture 15 29 46 60 71 70 67 60 52 5 0 0 
Public 0 0 0 0 8 49 71 71 40 78 74 58 


Panel B: Distribution of Valuations per Firm across the Venture Capital and the Public 





Equity Markets 
# Valuations per Firm in 
the Public Equity Market Welghted 
0 1 2 3 Sum Average 
# Valuations 0 0 0 ] 10 11 2.9 
Per Firn 1 6 I 3 25 35 2.3 
in the Venture 2 18 1 4 38 61 2.0 
Capital Market 3 16 4 6 39 65 2.0 
4 4 0 4 22 30 2.5 
5 0 oOo 9 2 2 3.0 
Sum 44 6 18 136 204 
Weighted Average 2.4 2.5 2.5 2.3 
Panel C: Number of Years between Firm Founding and IPO Eiling 
Years # Firms Years # Firms 

0 to 1 year 0 7 to 8 years 14 

] to 2 years 5 8 to 9 years 15 

2 to 3 years 12 9 to 10 years 6 

3 to 4 years 23 10 to 11 years 6 

4 to 5 years 43 11 to 12 years 6 

5 to 6 years 42 12 to 13 years 3 

6 to 7 years 24 > 13 years 5 


Panel D: Distribution of Firms’ Four-Digit SICs 


Four-Digit SIC —  — — SIC Industry Description — — f£ Firms — Firms 
2834 Pharmaceutical preparations 65 32 
8731 Commercial, physical, and biological research 50 24 
2836 Biological products (except diagnostic substances) 20 11 
3845 Electromedical and electrotherapeutic apparatus 12 6 
2835 In vitro and in vivo diagnostic substances 12 6 

All others Various (representing 17 four-digit SICs) 45 20 


(continued on next page) 
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TABLE 1 (continued) 


Panel E: Location of Firm's Headquarters at Time of IPO Filing 


State # Firms % Firms 
California 94 46 
Massachusetts 28 14 
Pennsylvania 11 5 
All others (20 states) 71 35 


The venture capital market sample is the set of U.S. biotech firms in Recombinant Capital's (Recap) pre-IPO 
valuation database that met three criteria: The firm filed to go public; its filing was online at http: / / 
www.sec.gov; and the valuation date was both less than 12 months after any of the firm's pre-IPO filing fiscal 
year-ends and was the first financing event after that fiscal year-end. The valuation date is the date the terms of 
the financing round were agreed on by the firm and the venture investors. The public equity market sample is 
the set of Recap firms that had an IPO filing document online and also completed their IPO. 


statements—would be usable. Financings that are not explicitly coded by Recap as being 
Series A, Series B, etc. are excluded from Table 2,142 as is the IPO filing or IPO offering. 
The reason for excluding the IPO itself is that a substantial body of research concludes 
that the equity values established at IPOs are associated with a number of anomalies (e.g., 
Shiller 1990; Ritter 1991; Ritter and Loughran 1995, 2002; Purnanandam and Swaminathan 
2004). There is also cause for concern as to whether equity values computed using the 
price range in the prospectus or the final offer price are arrived at in an arms-length manner, 
given the collusive behavior that has been attributed to underwriters (e.g., Ritter 2000). 

Table 2 shows that pre-money valuations increase as firms mature within the venture 
capital market, although at a decreasing rate. Valuations are much higher in the public 
market, partly because firms raise large amounts of cash at their IPO, and partly because 
going public may remove an illiquidity discount of 15 percent to 40 percent (Houlihan 
Valuation Advisors/ VentureOne 1998). 

A pattern of increasing median values is observed for almost all financial statement 
items except net income, which decreases steadily because biotech firms have small reve- 
nues but large expenditures on SG&A and R&D. Book-to-market ratios remain low during 
the time that firms are private, venture-backed entities. Of the 481 observations in the 
underlying pre-money valuation database, only 1 percent have a book-to-market ratio 
greater than 1. Consistent with firms' investment opportunity sets becoming smaller as firms 
mature, the median ratio of assets-in-place to pre-money equity value increases from 0.10 
at Series À rounds, to 0.16 at Series C rounds, to 0.21 at Series F rounds, to 0.40 at the 
end of the first year after an IPO, and to 0.46 at the end of the third year after an IPO. 
This supports a crucial assumption underlying the hypothesis that the value relevance of 
financial statements will rise as firms mature. 

Table 2 also reports the medians of several nonfinancial information items. For example, 
the age of the firm increases as the funding Series increases. The median gap between 


i4 Such financings include investments of common equity, private placements, and debt-related financing such as 
bridge and convertible notes. These can occur anywhere in the firm's pre-IPO life, whereas Series C financings 
never precede Series B financings, etc. 

5 One exception is that the median age of Series B financings is slightly less than that of Series A financings. 
Closer examination of this anomaly indicates that this was not due to data errors, but arose by chance because 
the procedure by which valuations and financial statement data are combined does not guarantee that age will 
be increasing in the Series level. It is also the case that sometimes a firm's first financing is labeled as Series 
B. 
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the date on which the pre-money valuation is established and the end of the preceding 
fiscal year is always four months or less and is smallest for Series A and largest for the 
IPO. This mitigates concerns that financial statement data are stale by the time the firm's 
current round of financing is undertaken and are staler the earlier the round. The median 
levels of the public market AMEX Biotech Index and the private market Sand Hill Biotech 
Index each rise steadily as firms mature. Equity dilution created at the current financing 
round declines from a high of 46 percent in Series A rounds to 17 percent in Series = F 
rounds, while the probability that the lead investor is a corporation steadily increases. The 
median number of patents filed and issued and the number of the firm's strategic alliances 
all increase monotonically as firms mature. 


V. REGRESSION METHODS 
Measuring Value Relevance 


In financial accounting research, financial statements are said to be value-relevant if 
they are associated with equity prices, values, or returns. Following prior studies, the value 
relevance of a set of financial statement data is measured using adjusted R?. Specifically, I 
define the value relevance of financial statements as the adjusted R? that can be uniquely 
attributed to financial statements in a regression of equity values on financial statements 
and nonfinancial statement information.! The total adjusted R? from a regression of equity 
values on financial statements and nonfinancial statement information can be decomposed 
into three mutually exclusive parts—that which is unique to financial statements, that which 
is unique to nonfinancial statement information, and that which is common to both financial 
statements and nonfinancial statement information. The adjusted R2 that is unique to finan- 
cial statements is the total regression adjusted R? less the adjusted R? from a regression of 
equity values on nonfinancial information alone. Likewise, the adjusted R? that is unique 
to nonfinancial statement information is the total regression adjusted R? less the adjusted 
R? from a regression of equity values on financial statements alone. Mindful of the infer- 
ential dangers of comparing adjusted R?s across samples (Brown et al. 1999; Gu 2002), I 
conduct auxiliary analyses to test whether such dangers are significant and conclude that 
they are not (see Section VI). ` 


Regression Models 


The main regression model that I use to assess the value relevance of financial state- 
ments and nonfinancial information in private and public equity markets is the log-linear 
model. This approach differs from the linear specifications applied in prior accounting 
research into value relevance, but it conforms to every analysis of venture capital valuations 
undertaken in the entrepreneurial finance literature (e.g., Lerner 1994a; Gompers and Lerner 
1999, 2000a, 2000b; Nicholson et al. 2003; Seppä 2003). The log-linear model has also 
been employed extensively in economics, particularly for evaluating R&D (Hall 1993, 2000; 
Hall et al. 2001). The advantages of the log-linear model include its flexibility in accom- 
modating nonlinear relationships (such as those arising from real options that are prevalent 
in biotech firms' equity values, as well as the interactions between biased accounting and 
large expenditures on intangible assets) and the econometric robustness it provides in deal- 
ing with outliers in the underlying nonlogged data. The main drawback of the log-linear 


16 Among others, Collins et al. (1997) and Barth et al. (1998) use incremental adjusted R? to measure value 
relevance. 

"7 Kaplan and Ruback (1995) and Berger et al. (1996) are two infrequent instances of the use of log-linear valuation 
models in the public equity market. 
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model is that it is difficult (although not impossible) to develop a model in which equity 
value is a log-linear function of financial statements or nonfinancial information from first 
principles.!* 

The regression models that I estimate include financial statement data from only the 
most recent fiscal year immediately prior to the valuation date. Including earlier financial 
statement data and/or instruments for expected future net cash flows, particularly as the 
firm matures and more years of historical financial statement data exists, could expand such 
a simplistic view. However, past financial statement data are highly correlated with current 
financial statement data, and direct measures of expected future net cash flows such as 
analyst forecasts are unavailable for pre-IPO firms.'? 

Ali dollar regression variables, Z, are log-transformed according to log.[Z + 1] where 
Z = 0 is in $000s. The following nondollar variables, Y, are log-transformed according to 
log.[Y + 1]: stock option dilution, the level of the AMEX Biotech Index, the level of the 
Sand Hill Biotech Index, new equity dilution at the financing date, the number of patents 
filed, the scope of patents filed, and the number of strategic alliances. Unreported statistics 
indicate that these log transformations dramatically reduce the skewness present in the raw 
data and achieve greater homoscedasticity in regression residuals relative to most models 
that are not log-linear (ranks of non-log-transformed data also achieve high homoscedas- 
ticity in regression residuals), ` 

The possibility that regression residuals may be serially correlated and/or heterosce- 
dastic because of calendar time clustering in equity valuations leads me to estimate the 
standard errors of regression parameters using Newey and West’s (1987) covariance matrix 
and Hansen’s (1982) GMM approach. This accommodates both serial dependence and con- 
ditional heteroscedasticity in the residuals and has a small loss of precision relative to OLS, 
should residuals in fact be homoscedastic and serially uncorrelated (Davidson and 
MacKinnon 1993). Inferences are unchanged if OLS standard errors are employed instead. 


VI. EMPIRICAL TESTS 
Regressions Where Different Financing Rounds Are Pooled within a Given Market 


I first estimate regressions in which observations are pooled within a given market. 
This provides a test of the first and second hypotheses of the paper—namely, that financial 
statements will overall be value-relevant in the venture capital market, and that the partial 
correlations between equity values and key financial statement items will be nonzero and 
of the same sign in both markets. The results, found in Table 3, broadly support these 
hypotheses. 

Table 3 shows that financial statements are similarly value-relevant in venture capital 
and public equity markets. Controlling for the effects of nonfinancial statement information, 
firms’ equity values are reliably positively related to cash, noncash assets, and R&D expense 
and are reliably negatively related to long-term debt and stock option dilution. Untabulated 
tests show that of the coefficients on the eight financial statement variables, only those on 


35 Ye and Finn (2000) motivate a log-linear model of firms’ equity market values, book equity, and net income by 
demonstrating that if the log of 1 plus the return on equity follows an AR(1) process, and net dividends are 
zero, then equity market value emerges as a multiplicative function of book equity and net income. Beatty et 
al. (2001) derive a log-linear valuation model under the assumptions that stock valuation is first-degree homo- 
genous in underlying valuation drivers, that accounting constructs measure such valuation drivers with multi- 
plicative measurement error that is conditionally lognormal, and that the unconditional distribution of stock 
values is either diffuse or lognormal. 

? Untabulated tests indicate that lagged financial statement data are rarely incrementally associated with current- 
period equity values. 
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cash and revenues reliably differ across markets. The strong significance of the coefficients 
on R&D and cash is consistent with the prominence that biotech venture capitalists accord 
to R&D and cash in their assessment of biotech firms' survivability and potential for suc- 
cess. The reliably negative coefficient on long-term debt dampens the potential criticism 
that the positive coefficients observed on cash, noncash assets, and R&D are illusory or 
overstated because such variables merely capture scale effects. Were that criticism valid, 





TABLE 3 


Log-Linear Regressions of U.S. Biotech Firms’ Pre-Money Valuations (PREMV) on Financial 
Statement Data (FS) and Nonfinancial Statement Information (NFS) 


1992-2003 


LnPREMV, = a + > 80,LnFS,, + >, yLnNFS.4 + Ex 
b c 


Independent Variables 
Financial Statement Data 


Ln (1 + long-term debt) 

Ln (1 4 revenue) 

Ln (1 + cost of sales) 

Ln (1 + SG&A) 

Ln (1 + R&D expense) 

Ln (1 + stock option dilution) 

Indicator: No stock option data 
Nonfinancial Statement Information 

Series À indicator 

Series B indicator 

Series C indicator 

Series D indicator 

Series E indicator 

Series = F indicator 

Two years after IPO indicator 

Three years after IPO indicator 

Corporate lead investor indicator 

Ln (AMEX public biotech index) 


Ln (Sand Hill venture biotech index) 


Ln (1 + firm age in years) 

Ln (1 + new equity dilution) 

Ln (1 + number of patents filed) 
Ln (1 + scope of patents filed) 


Ln (1 + number of strategic alliances) 


# observations 


# different firms 
OLS regression adj. R?” [total] 


Predicted 


Coefficient Sign 


+— I+ ++ 


+++ I! |! +++ 


Adj. R2 [unique to financial statement data] 
Adj. R? [unique to nonfinancial statement information] 


Adj. R? [common to both] 
Residual standard deviation 
Median absolute valuation error 
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Venture Capital Public Equity 
Market 


0.11 
0.18 
—0.05 
—0.01 
—0.01 
0.05 
0.15 
1.56 
—0.26 


—0.28 
"EAE 
—0.15 
—0.10 
—0.16 
—0.14 


0.21 
0.32 
0.17 
—0.01 
= 1.21 
0.07 
—0.21 
0.09 


481 


(44)*** 
(4.3)*** 
(—3.8)*** 


(2.9)*** 
(3.6)*** 
(1.8)* 

(—0.2) 


(—11.7)*** 


(2.7) *** 
(—3.0)*** 
(2.6)*** 


193 


7696 
1396 
1896 
4596 
0.56 


32% 


LH 


A A V VAV 


Market 


—0.07 
—0.15 


0.19 
—0.14 
0.06 


—0.39 
—0.05 


(6.9)*** 


(continued on next page) 
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* ** *** Denote coefficient estimates that are reliably significant at the 5 percent, 2.5 percent, and 1 percent 
levels, respectively. under a one-tailed test of the predicted sign on the coefficient. Where there is no sign 
prediction, significance levels are computed under a two-tailed test. 

The venture capital market sample is the set of U.S. biotech firms in Recombinant Capital's (Recap) pre-IPO 
valuation database that met three criteria: The firm filed to go public; its filing was online at http:// 
www.sec.gov; and the valuation date was both less than 12 months after any of the firm's pre-IPO filing fiscal 
year-ends and was the first financing event after that fiscal year-end. The valuation date is the date the terms of 
the financing round were agreed on by the firm and the venture investors. The public equity market sample is 
the set of Recap firms that had an IPO filing document online and also completed their IPO. 

All financial statement data are in $000s before log-transformation. 

The dependent variable is LnPREMV, , the log of firm Ps pre-money equity value at financing round k. GMM 
asymptotic Z-statistics are in parentheses. The intercept is estimated but not reported. Rounds other than those 
formally labeled by Recombinant Capital as Series A, B, C, D, E, and = F are reflected in the intercept. 

The AMEX public biotech index is an equal-dollar-weighted index designed to measure the performance of a 
cross-section of companies in the biotechnology industry that are primarily involved in the use of biological 
processes to develop products or provide services. It was established with a benchmark value of 200 on October 
18, 1991. 

The Sand Hill venture biotech index is a proprietary, periodic, timely, value-weighted, and unbiased index of the 
venture capital value of pre-IPO privately funded biotech companies developed by Sand Hill Econometrics of 
Palo Alto, CA. It had a benchmark value of 100 in January 1989. See http:/ / www.sandhillecon.com. 

New equity dilution is defined as the number of shares issued in the private equity financing round divided by 
the number of pre-money shares outstanding. The number of patents filed is the number of patents that had been 
filed as of the valuation date and that were issued by tbe U.S. Patent Office by 11/1/03, according to http:// 
www.uspto.gov. The scope of patents filed as of the valuation date is defined as the cumulative number of 
distinct four-digit International Patent Classification classes into which the firm's filed patents are assigned as of 
the valuation date, divided by the number of patents the firm had filed for as of the valuation date. 

The number of strategic alliances at the valuation date is the sum of the firm's upstream and downstream 
alliances in place at that time. These data are obtained from Recombinant Capital's alliances database. 

The median absolute valuation error is an economic measure of the regression goodness-of-fit. The valuation 
error is defined as the ratio of the actual equity value less the fitted equity value, divided by the fitted equity 
value. 


the strongly positive univariate correlations between equity values and long-term debt would 
lead to a positive, not negative, estimated regression coefficient on long-term debt. 

Two unexpected results emerge. The coefficient on revenue is not reliably positive in 
the venture capital market, and the coefficient on cost of sales is not reliably different from 
zero in either market. The former finding may reflect the transitory nature of young biotech 
firms' revenues, in that it takes many years for a biotech firm to develop a product and 
thereby generate recurring product revenue.” Consistent with this explanation, the coeff- 
cient on revenue in the public market—where recurring product revenue is more com- 
mon— is reliably positive. Also, although SG&A expense is not reliably related to firms’ 
equity values, this likely occurs because SG&A contains a mixing of period expenses that 
would be expected to be negatively related to equity value (such as rent on the firm's 
facilities), and costs that provide future benefits (such as salaries for senior management 
and scientific personnel) that would be expected to be positively related to equity value, 
particularly when the firm is young and intellectual capital is most vital. 

Table 3 also provides evidence on the value relevance of nonfinancial statement infor- 
mation in the venture capital market. The signs and significance of coefficient estimates on 


20 Few biotech companies generate substantial amounts of recurring product revenues until their drugs have passed 
through the FDA hurdles, which can be as much as 15 years into the life of a biotech firm (see Figure 1). 
Instead, young biotech firms generate revenues from fixed-period collaborations, contracts, grants, licenses, and 
research. 
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almost all the nonfinancial statement variables in Table 3 agree with what prior work found. 
Pre-money valuations are increasing in the level of publicly traded biotech company values 
and the number of a firm's patents and alliances and are also higher if the lead investor is 
a corporation rather than venture fund. One discrepancy is the reliably negative coefficient 
on the scope of the firm's patents, which Lerner (1994a) finds to be reliably positive. This 
result is not due to controlling for financial statement data; untabulated results show that 
the coefficient is negative if no financial statement data are included in the regression. I 
speculate that the economics of patent protection may have changed over time such that it 
is now more valuable to hold a portfolio of narrowly scoped patents rather than broadly 
scoped ones.?! 

The estimated coefficients on each of the two items of nonfinancial statement infor- 
mation that have not been previously examined in the entrepreneurial finance literature are 
reliably different from zero in the predicted directions. First, the coefficient on the SHB 
Index 1s reliably positive and not significantly different from the larger coefficient estimated 
on the AMEX Biotech Index. This is consistent with the proposition that the SHB Index 
is informative about the overall prospects of the biotechnology sector over and above that 
provided by the prices of publicly traded biotech companies. Second, the size and signifi- 
cance of the coefficient on equity dilution created by the current financing round strongly 
suggests that demand is far from being perfectly elastic in the venture capital arena. The 
latter conclusion is the same as that of Gompers and Lerner (2000b) and Seppä (2003). 
However, untabulated tests indicate that the proxy used in Gompers and Lerner (2000b) 
and Seppä (2003)—the total amount of funding committed to U.S. venture capital funds in 
the 12 months leading up to the valuation date—is insignificant when it is included in the 
regression alongside the equity dilution variable. 


Robustness Tests for Pooled Regressions 


Untabulated results indicate the findings reported in Table 3 are robust to several issues. 
First, some prior work has found higher pre-money valuations in western states, particularly 
California, most likely because of the value created by geographical concentrations of 
scientific capital, venture capital, or business networks (e.g., Gompers and Lerner 2000a; 
Seppä 2003).? This is not the case here. Other untabulated results indicate that coefficient 
estimates on indicators for California- and Massachusetts-headquartered firms and the major 
sub-industries in biotechnology are insignificant. Second, Gompers and Lerner (2000b) and 
Seppä (2003) find that pre-money valuations are higher when inflows of capital into venture 
funds over the 12 months leading up to the valuation date are higher. That is, firm valuations 
reflect less than perfectly elastic demand in that “too much money is chasing too few 
deals." The same result is found in this study, but only when the Sand Hill Biotech Index 
is replaced by the inflows of capital into venture funds over the prior 12 months. This is 
because the correlation between the level of the Sand Hill Biotech Index and inflows of 
capital into venture funds over the 12 months leading up to the valuation date is 0.95. If 
both variables are included in the regression, then the equity dilution created by the current 
financing round is found to be a superior proxy for the presence of less than perfectly 
elastic demand in that the estimated coefficient on equity dilution (asymptotic Z-statistic 
= —11.7, one-tailed p-value < 0.0001) far exceeds that on venture capital fund inflows 


?! Lerner’s and my samples do not cverlap in time, making this at least possible: Lerner’s data period is 1973— 
1992, whereas mine is 1992—2003. 

22 The firms examined in Gompers and Lerner (20008) and Seppä (2003) are not limited to biotech firms, but 
come from a broad cross-section of industries. 
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(asymptotic Z-statistic = —1.6, one-tailed p-value = 0.06). This is most likely because the 
equity dilution proxy is more firm-specific. 

It is also the case that if the total number of strategic alliances is separated into upstream 
and downstream deals, then the coefficient estimate on each deal-type is reliably negative, 
but there is no reliable difference between the coefficient estimates. Finally, the same in- 
ferences as those in Table 3 emerge if a rank regression is estimated instead of a log-linear 
model. The only exception is that the estimated coefficient on the total number of alliances 
is no longer reliably positive at the 1 percent level under a one-tailed test, only at the 10 
percent level. 


Regressions Estimated on a Financing-Round-by-Financing-Round Basis 

The third hypothesis of the paper is that financial statements will be value-irrelevant 
when a firm is founded but become increasingly value-relevant as the firm matures. The 
fourth is that nonfinancial statement information will become less value-relevant as a firm 
matures. The pooled regressions in Table 3 provide coarse confirmation of these hypotheses 
in that the results suggest that financial statements are, on average, less value-relevant in 
the venture capital market than they are in the public market, and vice versa for nonfinancial 
statement information. Specifically, the adjusted R? uniquely explained by financial state- 
ments is 13 percent in the venture capital market versus 26 percent in the public market, 
while the adjusted R? uniquely explained by nonfinancial information is 18 percent in the 
venture capital market and only 1 percent in the public market. 

Table 4 provides more powerful tests of the dynamic behavior of the value relevance 
of financial statements and nonfinancial statement information as firms mature. Moving 
from left to nght, Table 4 reports the results of round-by-round regressions that reflect 
increasing levels of firm maturity. Compared to pooled regressions, round-by-round re- 
gressions mitigate the potential impacts of sample selection bias because they control for 
one factor associated with the likelihood of filing for an JPO, namely the firm’s maturity 
(Cochrane 2004). Round-by-round regressions should also mitigate any time-varying effects 
of not explicitly controlling for risk. 

The regressions estimated in Table 4 contain only the most significant of the variables 
included in Table 3’s pooled regressions, and with cash and noncash assets combined into 
total assets. This constriction of the set of independent variables is applied because there 
are far fewer observations per round than in the pooled regressions, and because there are 
high correlations among some independent variables. Untabulated analysis indicates that 
the inferences drawn from the results in Table 4 are not sensitive to including a larger set 
of independent variables. 

The key results in Table 4 are consistent with the third hypothesis. Financial statements 
are value-irrelevant at (and only at) the Series A round, the first formal financing point for 


In-depth conversations with venture capital investors, particularly those who specialize in biotechnology, indi- 
cated that they deliberately stage their financing so that a firm's financing Series stage is more indicative of its 
scientific and economic maturity than is its calendar age. For example, biotech venture capitalists will typically 
only invest in the first formal financing round—the Series A—if certain scientific and business elements are 
present or will immediately be made present as a result of the funding. For example, they might require the 
firm to have a scientifically innovative idea for a new drug candidate, one or more highly regarded scientists, 
and at least one well-respected and seasoned biotech business executive. To obtain further financing through a 
Series B round, and then a Series C round, etc., the firm would need to achieve a number of specified scientific 
and/or business milestones, with significant flexibility given as to how long those milestones take to achieve. 
Evidence consistent with this venture capital perspective on firm maturity is that if firm age is added to the 
private equity market pooled regression estimated in Table 3, its coefficient is insignificant (t-statistic = —0.2). 
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venture capital investors. The p-value on the F-statistic testing the constraint that the co- 
efficients on both total assets and R&D expense are zero at Series À rounds is an insig- 
nificant 0.17. The second key result is that the value relevance of financial statements 
reliably increases as firms mature. Figure 2 plots the adjusted R? uniquely explained by 
each type of information shown in Table 4 as a function of the round-by-round maturity 
of the firm, together with OLS trendlines. The one-tailed p-value on the positive slope of 
the OLS trendline in the R? uniquely attributable to financial statements is 0.03. Very similar 
results obtain if ranks are used instead of log-transformed data. At least for biotech firms, 
financial statements are value-irrelevant at the earliest venture capital financing point, but 
become steadily more value-relevant as firms mature. 

Beyond this finding, several other results in Table 4 deserve mention. First, the median 
ratio of total assets to total firm value, a proxy for the relative size of assets-in-place versus 
future investment options (Smith and Watts 1992; Gaver and Gaver 1993; Adam and Goyal 
2003), reliably increases as firms mature (the two-tailed p-value on the positive trendline 
slope is less than 0.001). In addition, the correlation between the median ratio of total assets 
to total firm value and the R? uniquely attributable to financial statements is 0.51 (one- 
tailed p-value is 0.09). 

Second, consistent with the fourth hypothesis of the paper, the value relevance of 
nonfinancial information reliably decreases as firms mature. In Figure 2, the one-tailed p- 
value on the negative slope of the nonfinancial information unique R? trendline is 0.03. 
Virtually identical results obtain if ranks are used instead of log-transformed data. The 
time-series correlation between the value relevance of financial statement data and nonfi- 
nancial statement data is reliably negative (one-tailed p-value is 0.04), and that between 
the median ratio of total assets to total firm value and the value relevance of nonfinancial 
information is —0.82 (one-tailed p-value is 0.004). These indicate that in a dynamic sense, 
financial statements and nonfinancial statement data are informational substitutes, not com- 
plements, in valuation. 

Third, obtaining an accurate view of the dynamic behavior of the value relevance of 
nonfinancial statement information hinges on controlling for the value relevance of financial 
statement data. Specifically, round-by-round regressions that exclude financial statements 
yield an essentially flat adjusted R? series of 43 percent, 46 percent, 50 percent, 51 percent, 
44 percent, and 41 percent at Series A through Series = F rounds, respectively. This 
contrasts with the starkly declining series of adjusted R^s uniquely attributable to nonfi- 
nancial information of 31 percent, 26 percent, 17 percent, 16 percent, 9 percent, and 5 
percent, respectively (Table 4), where the effects of financial statements are controlled for. 

Fourth, as suggested by Table 3's results, Table 4 indicates that the lion's share of the 
value relevance of nonfinancial statement information in the venture capital market comes 
from the AMEX Biotech Index and the dilution created by the current financing round. On 
a round-by-round basis, the value relevance of patents and strategic alliances is very low. 
The same is true in public equity markets until the firm is three years or so beyond its IPO, 
at which point the AMEX Biotech Index and the number of patents each become value- 
relevant. The weak associations between equity values and the number of patents. filed and 
strategic alliances entered into may reflect the simplicity or coarseness of these proxies. 
However, total assets are also a coarse and simple proxy for the economic value of assets- 
in-place. 

Finally, the relations between value relevance and firm maturity are not artifacts due 
to scale or sampling variation. If this were so, then value relevance would be positively 
correlated with the coefficient of variation in candidate scale factors such as equity value 
or total assets (Brown et al. 1999, 2002). This is not the case; the Spearman correlations 
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between the adjusted R? uniquely attributable to financial statements and the coefficients 
of variation of log-transformed equity value and log-transformed total assets are —0.39 and 
—0.64, respectively. Alternatively, if the pattern in value relevance were due to sample 
variances that were systematically increasing over time, then value relevance would be 
positively correlated with the standard deviation of the regression residuals (Gu 2002). This, 
too, is not the case. The correlation between the adjusted R? uniquely attributable to finan- 
cial statements and the standard deviation of round-by-round regression residuals is —0.14, 
while the correlation between the adjusted R? uniquely attributable to nonfinancial statement 
information and the standard deviation of round-by-round regression residuals is —0.22. 


The IPO 


As noted in Section IV, this study has thus far deliberately excluded the IPO itself as 
a valuation point because a substantial body of literature has concluded that equity values 
set at and close to IPOs are associated with notable anomalies (e.g., Shiller 1990; Ritter 
1991; Ritter and Loughran 1995, 2002; Ritter 2000; Purnanandam and Swaminathan 2004). 
Common to many explanations for IPO-related pricing anomalies is the proposition that 
IPOs are sold by sophisticated underwriters to unsophisticated retail investors—i.e., that 
the marginal investor is markedly less than fully rational during the early months of a newly 
public firm's life. 

In the context of this study, the unsophisticated marginal investor proposition implies 
that the value relevance of financial statement data (nonfinancial statement information) at 
the IPO filing will be lower (higher) than that implied by the maturity-based trends reported 
in Table 4 and Figure 2. The rationale is that unsophisticated investors are likely to place 
too much weight on the value relevance of nonfinancial statement information, particularly 
herd-related variables such as the “hotness” of the IPO market, and therefore too little 
weight on the true value relevance of financial statement data. 

Table 5 reports the results of testing these predictions. Table 5 is structured in the same 
manner as Table 3, except that in between the columns of results for the venture capital 
and public equity markets are the results of estimating the same pre-money equity valuation 
regression at the IPO filing. The IPO filing results are consistent with the predictions made 
by the unsophisticated investor proposition. Specifically, at the IPO filing, the adjusted R?s 
uniquely attributable to financial statements and nonfinancial information are 2 percent and 
28 percent, respectively. These are almost exactly the reverse of what would be expected 
based on the trendlines in unique adjusted R?s shown in Figure 2. Among financial state- 
ment data, only noncash assets are reliably associated with firms' pre-money IPO equity 
values. In contrast, among nonfinancial items of information, the AMEX Biotech Index and 
number of patents filed are reliably positively related to pre-money IPO equity values, and 
the new equity dilution created at the IPO is reliably negatively related. Both the AMEX 
Biotech Index and the new equity dilution created at the IPO can be interpreted in an IPO 
context as measures of herd-related forces such as the “hotness” of the IPO market. It 
therefore appears that the value relevance of both financial statement data and nonfinancial 
statement information to some extent follows a “sawtooth” pattern when the IPO is in- 
cluded in the maturity timeline of U.S. biotech firms. 


Caveats and Limitations 

Beyond the straightforward concern as to how well the results reported here for venture- 
backed biotech firms generalize to the population of all pre-IPO firms, this paper has two 
main limitations: differences between the venture capital and public equity markets that 
have not been controlled for, and the uncertain impacts of selection bias. 
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As detailed in Section IL, the venture capital market differs from the public equity 
market along many dimensions, including regulation, liquidity, price-setting mechanisms, 
and degree of information asymmetry between managers and investors. Such differences 
may affect the value relevance of financial statements in ways that have not been taken into 
account in this paper. However, it is difficult to pinpoint in what ways and how strongly 


TABLE 5 
A Comparison of Log-Linear Value Relevance Regressions for U.S. Biotech Firms in the 
Venture Capital Market, at the IPO Filing, and in the Public Equity Market 


1992—2003 
LnPREMV, = a + > 0,LnFS,, + >, y,LnNFS, s + En 
Venture Capital At the Public Equity 
Independent Variables Market IPO Filing Market 
Financial Statement Data 
Ln (1 + cash) 0.11 (4.4)*** 0.06 (1.0) 0.531 (6.9)*** 
Ln (1 + noncash assets) 0.18 (4.3*** 0.16  (2.3)** 0.19  (3.3)*** 
Ln (1 + long-term debt) —0.05  (—3.8)*** —0.02 (-1.5) —0.03 (—2.3)*** 
Ln (1 + revenue) —0.01 (-12) —0.02 (-—1.5) 0.04  (1.8)* 
Ln (1 + cost of sales) —0.001 (—0.7) —0.01 (—0.8) 0.00 (0.1) 
Ln (1 + SG&A) 0.05 (0.9) —0.11 (-1.0) —0.14 (—1.3) 
Ln (1 + R&D expense) 0.15 (2.9)5*** 0.04 (0.6) 0.14 (1.5) 
Ln (1 + stock option dilution) —1.56 (—4.2)*** 0.10 (0.1) —2.50 (-33)*** 
Indicator: No stock option data —026 (—3.D*** 
Nonfinancial Statement Information 
Series A indicator —0.28  (—1.5) 
Series B indicator —021  (-2.1)* 
Series C indicator —0.5 (—1.7 
series D indicator —0.10 (-1.0) 
Series E indicator —0.16 (—1.6) 
Series = F indicator —0.14 (-1.1) 
Two years after IPO indicator —0.07 (—0.8) 
Three years after IPO indicator —0.15 (—1.2) 
Corporate lead investor indicator 0.21 (2.9) *** 
Ln (AMEX public biotech index) 0.32 (3.00*** 0.54 (2.7)*** 0.19 (1.6) 
Ln (Sand Hill venture biotech 0.17 (1.8)* 0.19 (0.7) 
index) 
Ln (1 + firm age in years) —0.01 (—0.2) 0.08 (1.0) —0.14 (-—1.4) 
Ln (1 + new equity dilution) —121 (-11.7)9*** —27] (—3.9)*** 
Ln (1 + number of patents filed) 0.07 (2.7)*** 0.13 (3.1)*** 0.06 (1.5) 
Ln (1 + scope of patents filed) —0.21 (—3.0)*** —0.08 (—0.5) —0.39 (—2.2)** 
Ln (1 + number of strategic 0.09 (2.6)*** 0.09 (1.5) —0.05 (-0.8) 
alliances) 
# Observations 481 103 449 
# Different firms 193 103 160 
OLS regression adj. R? [total] 7696 7296 4796 
Adj. R? [unique to FS data] 1396 296 2696 
Adj. R? [unique to NFS 1896 2896 196 
information] 
Adi. R? [common to both] 4596 4296 2096 
Residual standard deviation 0.56 0.44 0.87 
Median absolute valuation error 3296 .. 2696 49% 


(continued on next page) 
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*,**,*** Denote coefficient estimates that are reliably significant at the 5 percent, 2.5 percent and 1 percent 
levels, respectively, under a one-tailed test of the sign on the coefficient predicted in Table 3. Where there is no 
sign prediction, significance levels are computed under a two-tailed test. 

The venture capital market sample is the set of U.S. biotech firms in Recombinant Capital’s (Recap) pre-IPO 
valuation database that met three criteria: The firm filed to go public; its filing was online at http:// . 
www.sec.gov; and the valuation date was both less than 12 months after any of the firm's pre-IPO filing fiscal 
year-ends and was the first financing event after that fiscal year-end. The valuation date is the date the terms of 
the financing round were agreed on by the firm and the venture investors. The IPO filing sample is the set of 
Recap firms that had an IPO filing document online. The public equity market sample is the set of Recap firms 
that had an IPO filing document online and also completed their IPO. All financial statement data are in $000s 
before log-transformation. 

The dependent variable is LAPREMV,,, the log of firm Ps pre-money equity value at financing round k. GMM 
asymptotic Z-statistics are in parentheses. The intercept is estimated but not reported. Rounds other than those 
formally labeled by Recap as Series A, B, C, D, E, and =F are reflected in the intercept. 

The AMEX public biotech index is an equal-dollar-weighted index designed to measure the performance of a 
cross-section of companies in the biatechnology industry that are primarily involved in the use of biological 
processes to develop products or provide services. It was established with a benchmark value of 200 on October 
18, 1991. 

The Sand Hill venture biotech index is a proprietary, periodic, timely, value-weighted, and unbiased index of the 
venture capital value of pre-IPO privately funded biotech companies developed by Sand Hill Econometrics of 
Palo Alto, CA. It had a benchmark value of 100 in January 1989. See http:/ / www.sandhillecon.com. 

New equity dilution is defined as the number of shares issued in the private equity financing round divided by 
the number of pre-money shares outstanding. The number of patents filed is the number of patents that had been 
filed as of the valuation date and that were issued by the U.S. Patent Office by 11/1/03, according to http:// 
www.uspto.gov. The scope of patents filed as of the valuation date is defined as the cumulative number of 
distinct four-digit International Patent Classification classes into which the firm's filed patents are assigned as of 
the valuation date, divided by the number of patents the firm had filed for as of the valuation date. 

The number of strategic alliances at tbe valuation date is the sum of the firm's upstream and downstream 
alliances in place at that time. These data are obtained from Recap's alliances database. The median absolute 
valuation error is an economic measure of the regression goodness-of-fit. The valuation error is defined as the 
ratio of the actual equity value less the fitted equity value, divided by the fitted equity value. 


such differences will affect the average levels of the value relevance of financial statement 
` data and nonfinancial statement information in the two markets or the dynamic behavior of 
value relevance as firms mature. | 

With regard to biases that may arise from sample selection criteria, restricting the 
sample to firms that were successful enough to file for an IPO yields benefits in, the form 
of rich financial statement data, but also carries risks. Specifically, the coefficients on the 
determinants of pre-money valuations may be biased if financial statements or nonfinancial 
information are more value-relevant for successful firms: Any bias may also be more severe 
the earlier the financing round to which the pre-money valuation applies. 

It may be that any inferential biases are small. First, the firms used in this study consist 
of a truncated sample in that neither valuations nor valuation determinants are observed for 
firms that do not file to go public (because they are bought out, terminate, or continue in 
existence without new private equity funding or an IPO filing). The standard result of 
truncating on variable y, when the true relation between y and x is linear, is to bias toward 
zero the slope coefficient in a regression of observed y on observed x. This suggests that 
the slope coefficients on the determinants of valuations may be biased toward, not away 
from, zero. Second, the pooled regressions include financing round indicators in pre-money 
valuation regressions. The presence of these indicators should mitigate bias on slope co- 
efficients that would otherwise arise from more mature rounds having larger mean values 
of the pre-money valuation dependent variable (Maddala 1977). Finally, in their study of 
the impact of fund inflows on venture-backed firms' pre-money valuations, Gompers and 
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Lerner (2000b) find that using Heckman's (1976) two-stage method to correct for selection 
bias has little impact on the magnitude or the significance of their independent variables.” 


VIL CONCLUSIONS 

This study contributes to the value relevance literatures in accounting and entrepreneu- 
rial finance by documenting the level, nature, and dynamics of the value relevance of 
financial statements in the venture capital subsector of the private equity market, and over 
the previously unexamined early stages of the firm life-cycle. Using a sample of U.S. 
biotechnology firms during 1992—2003, I found that financial statements are, on average, 
value-relevant in the venture capital market, despite the fact that such markets are different 
from the public market in numerous economically significant ways. Financial statements 
are also similarly associated with equity values in economically sensible ways in each 
market equity values are positively related to firms' cash balances, noncash assets, and 
R&D expense, and are negatively related to their long-term debt and stock option dilution. 
This indicates that, as intended by the FASB in setting up the Conceptual Statements and 
Issued Standards, GAAP successfully provides information that is useful to investors outside 
of the public equity market. This is true even for firms such as young biotech companies 
whose equity values are dominated by expectations of profitable future investments rather 
than assets-in-place. 

I also showed that financial statement data are less value-relevant in the venture capital 
market than is nonfinancial statement information, and that the opposite holds true in the 
public equity market. Specifically, financial statements and nonfinancial statement infor- 
mation for biotech companies are in a dynamic sense information substitutes, not comple- 
ments. As firms mature, the value relevance of their financial statements reliably increases, 
while the value relevance of nonfinancial statement data reliably decreases. I provided 
evidence that this divergence occurs because as firms mature, the importance of assets-in- 
place increases relative to that of investment opportunities, and financial statement data are 
better than nonfinancial information at reflecting the value stemming from assets-in-place. 
The only exception to these patterns occurs at the IPO, where the value relevance of fi- 
nancial statement data and nonfinancial statement information reverse in importance. This 
"sawtooth" in the maturity trends in value relevance appears to occur because the marginal 
investor in an IPO is markedly less sophisticated than either venture capital investors in the 
pre-IPO venture capital market, or investors in seasoned post-IPO firms. 

Finally, beyond documenting the strong relations between financial statement data and 
equity value in the venture capital market, this paper extends the literature in entrepreneurial 
finance by combining into one analysis most of the nonfinancial statement information 
variables examined in prior work, and also by identifying two new value-relevant nonfi- 
nancial factors. In particular, the equity dilution created by the current round is shown to 
be a very significant factor in explaining variation in firms' pre-money valuations, indicating 
that the elasticity of demand in the venture capital market is markedly less than infinite. 
Given this, an appealing topic for future research to explore might be to identify the de- 
terminants of the elasticity of demand, and whether and why financial statements are in- 
cluded in the set of those determinants. It would also be fruitful to determine whether the 
results found for biotech firms generalize to other technology firms, as well as non- 
technology companies, which sometimes obtain venture capital funding. 


24 Gompers and Lerner (2000b) first estimate the probability that VentureOne, their data provider, was able to 
obtain information about the valuation in the financing round, and then seek to explain the determinants of the 
valuation after adjusting for that probability. 
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ABSTRACT: | examine the association between audit committee quality and the quallty 
of corporate Internal control. While information on the quality of internal control is not 
generally available, companies changing auditors are required to disclose any Internal 
control problems that were pointed out by their predecessor auditors. The empirical 
results are based on a comparison of companies disclosing such internal control prob- 
lems with a control sample of companies changing auditors but not disclosing internal 
control problems. Audit committee quality Is measured in three dimensions: its size, its 
independence, and its expertise. The internal contro! problems are observed at two 
levels of increasing seriousness: reportable conditions and material weaknesses. The 
sample time period precedes the effective dates of recent policy changes regarding 
audit committees. The results Indicate that independent audit committees and audit 
committees with financial expertise are significantly less likely to be associated with 
the Incidence of internal contro! problems. This is true for both levels of Internal control 
problems. The results are consistent with recent policy emphasis on audit committee 
independence and expertise. 


Keywords: audit committee; internal control; reportable condition; material weakness. 
Data Availability: Data used in this study are from public sources. 


I. INTRODUCTION 
he Sarbanes-Oxley Act of 2002 and the Securities and Exchange Commission (SEC) 
rules introduced to carry out the directions of the Act indicate that improving cor- 
porate internal control is an important policy initiative (U.S. House of Representa- 
tives 2002). The SEC now requires CEOs and CFOs to certify annually and quarterly about 
the effectiveness of their corporate internal control (SEC 2002). Further, the SEC has an- 
nounced rules for the implementation of Section 404 of the Sarbanes-Oxley Act, requiring 
every registrant to include an internal control report in its annual report, and its independent 
auditor to issue an “attestation report on management's assessment of the company's in- 
ternal control over financial reporting" (SEC 2003a, 1). 
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The emphasis on good internal contro] of course arises because it is considered to be 
an important factor in achieving good quality financial reporting. Among the numerous 
criticisms leveled at Enron Corporation regarding its financial reporting was the charge of 
a failure in its internal control (Verschoor 2002). It is therefore important to examine mon- 
itoring mechanisms corporations can use to ensure the effectiveness of their internal control. 
Establishing and maintaining proper internal control is the responsibility of management. 
One monitor of an entity's internal control is its audit committee.! Although not mandated 
by SEC rules, the accounting profession and policy makers have maintained that a primary 
function of the audit committee should be the oversight of internal control? Anecdotal 
evidence (e.g., disclosures in proxy filings) and academic research suggest audit committees 
do view monitoring internal controls as one of their functions. Carcello et al. (2002, 294) 
report that 91 percent of a random sample of corporate audit committee charters list reviews 
of internal controls as one of the duties of the audit committee. Recently, the SEC (2003b) 
has mandated that all material written communications between a company's accountant 
and management be provided to the entity's audit committee, and this includes "reports on 
observations and recommendations on internal controls." The implication is that an entity's 
internal controls should be under the purview of its audit committee. 

Because the audit committee is expected to monitor the entity’s internal control, I posit 
a positive association between audit committee quality and internal control quality. What 
constitutes a good audit committee however is not clear. In 1999, the major exchanges 
introduced new requirements for the size, independence, and expertise of corporate audit 
committees with a view to improving financial reporting. In its endorsement of the ex- 
changes' proposals to revise audit committee requirements, the SEC quoted the Blue Ribbon 
Committee on Improving the Effectiveness of Corporate Audit Committees (BRC) to argue 
that, under the new requirements, audit committee directors would be “more likely to 
objectively evaluate the propriety of management's accounting, internal control, and finan- 
cial reporting practices" (SEC 1999) (emphasis added). However, given the skepticism 
about the effectiveness of audit committee independence following recent disclosures about 
the composition of Enron's audit committee (which was independent as defined by the 
SEC), and concerns about the adequacy of the audit committee expertise requirement 
(Whitehead 2002), it is an open question whether the audit committee independence and 
expertise requirements can favorably affect the monitoring of internal control. 

I examine the association between audit committee composition and the quality of 
corporate internal control, using data from a time period preceding the effective date (June 
2001) for the SEC (1999) rules. Data on internal control quality is not generally available, 
which explains the absence of studies in this area. However, companies that change auditors 


a^ 


More completely, monitoring internal control is the result of actions by, and interaction between, management, 
the internal auditor (if present), the external auditor, and the audit committec. I focus on whether, holding other 
things constant, the audit committee has an impact on internal control quality. 

An early release by the SEC stressed the "importance of andit committees to enable board of directors to better 
fulfill their oversight responsibilities with respect to an issuer's accounting, financial reporting and control 
obligations" (SEC 1979, 427). The Treadway Commission's report recommended: “To be effective, audit com- 
mittees should exercise vigilant and informed oversight of the financial reporting process, including the com- 
pany's internal controls" (NCFFR 1987, 12). 
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are required to disclose any internal control problems that were pointed out by their pred- 
ecessor auditors." Using Form 8-K disclosures about internal control problems for a sample 
of companies that changed auditors, and a control sample of companies that changed au- 
ditors but did not report internal control problems, I find a negative association between 
the proportion of independent members on the audit committee and the existence of internal 
control problems. This result supports recent studies that have shown an association between 
audit committee independence and two aspects of the quality of financial reporting, the 
auditor's issuance of the going concern opinion for financially stressed companies (Carcello 
and Neal 2000), and the extent of earnings management as measured by discretionary 
accruals (Klein 2002b). I also find a negative association between the number of audit 
committee members with financial expertise and the existence of internal control problems. 
This finding suggests, in line with prior experimental studies regarding the association 
between audit committee expertise and financial reporting (e.g., McDaniel et al. 2002; 
DeZoort 1998), that audit committee financial expertise is associated with better controls. 

The next section describes the hypothesis and research method. In Section MI, I describe 
the data. This is followed in Sections IV and V by the empirical findings and additional 
tests. The last section contains a discussion of the results and my conclusions. 


IIl. HYPOTHESIS AND RESEARCH METHOD 

The quality of an entity's internal control is a function of the quality of its control 
environment (which includes the board. of directors and the audit committee) and its other 
non-governance-related controls. The profession and policy makers have noted repeatedly 
that one of the recommended duties of the audit committee is to review the entity's internal 
control. For example, the BRC (1999, 38) recommends that “the audit committee should 
encourage procedures that promote accountability ... ensuring that management properly 
develops and adheres to a sound system of internal control."^ Other policy-making bodies 
have, time and again, made similar recommendations. 

. While it seems clear that audit committees are expected to play a role in maintaining 
effective control, what happens in reality? There is some evidence that audit committees 
participate in the maintenance of control. Based on a survey of audit committee members, 
DeZoort (1997, 210) reports that “audit committee members perceive internal control eval- 
uation as the most important audit committee oversight area." A study of a random sample 
of audit committee charters (Carcello et al. 2002) reveals that a majority of audit committees 
claim review of controls to be one of their duties. Examination of disclosures in proxy 
filings of companies in my sample also shows that in practice, corporate audit committees 
do engage in the examination of controls. 

If audit committees are expected to oversee, and view themselves as overseers of, 
entities” internal control, then it seems plausible that a high-quality audit committee would 
be associated with a good-quality internal control. Other studies have found that the quality 
of the audit committee is associated with financial reporting outcomes (Carcello and Neal 
2000; Klein 2002b). Thus I posit the following: 


> I acknowledge that this sample is not representative of the population of firms, because auditor change firms 
have been shown in past work to be different from the general population of firms. (See Krishnan and Krishnan 
(1997] and Raghunandan and Rama [1999] for discussions of this literature.) However, as I note later, smaller 
firms are frequently associated with restatements and fraud, both of which could be associated with internal 
control problems. | 

^ Ernst & Young (1992) identifies the review of internal control as one of the responsibilities of the audit com- 
mittee, and suggests the committee should engage the independent auditors, the internal auditors, and financial 
and accounting personnel in these reviews. Similar recommendations under “best practices" for audit committees 
are made by other Big 5 firms (e.g., PricewaterhouseCoopers 2000; KPMG 1999). 
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H1: Other things equal, the quality of the audit committee is positively associated with 
the quality of internal controls. 


The mechanisms through which a high-quality audit committee would ensure a high- 
quality internal control could differ across entities. It could directly engage in overseeing 
the entity's controls by reviewing "internal accounting procedures and controls with the 
company's financial and accounting staff," as recommended by a special committee of the 
AICPA in 1979 (Rittenberg and Nair 1993, 65). More likely, it could engage in discussions 
with the internal and external auditors about their assessment of internal control. The ex- 
ternal auditor is expected (per SAS No. 61, AICPA 1988c) to communicate with the audit 
committee if it identifies concerns about the quality of the entity's internal controls. A 
better-quality audit committee is likely to follow up on these communications by eliciting 
recommendations for improvement and ensuring management carries out these recommen- 
dations. Indirect evidence that better audit committees might demand more audit effort or 
greater audit scope and coverage is provided by Abbott et al. (2003), who report a positive 
association between audit committee effectiveness and audit fees. 


Internal Control Measure 


To test the hypothesis, I estimate a logistic regression model with a measure of internal 
control quality as the dependent variable, and audit committee measures and control vari- 
ables as independent variables. The internal control quality measure, /CPROB, coded 1 or 
O according to the presence or absence of internal control problems, is based on auditor 
communications that were made privately to the client during the audit, but subsequently 
made public on the Form 8-K when a change of auditor occurred. Standards for auditor 
judgment about the entity's internal control are contained in SAS No. 55 (AICPA 19882), 
subsequently amended by SAS No. 78 (AICPA 1995). Standards for communicating the 
findings to the client are set out in SAS No. 60 (AICPA 1988b). SAS No. 60 requires that 
if, during the course of an audit, the auditor becomes aware of deficiencies in an entity's 
internal control, termed “reportable conditions," it should communicate such matters— 
through written or oral reports—to the audit committee or to someone of equivalent au- 
thority. These deficiencies may be discovered at any point during the audit and must be 
communicated regardless of the control risk assessment and the decision to rely or not rely 
on the client's controls. 

These communications are not generally publicly available. However, companies that 
change auditors are required to state whether any "reportable events" had occurred over 
the previous two years, and if so, disclose the nature of the reportable events. Reportable 
events include any internal control issues raised by the predecessor auditor (SEC 1988).° 

In addition to reporting the existence of an internal control problem, the 8-K disclosures 
also reveal its severity, which, based on SAS No. 60, can take two levels. The first level 
of problems are “reportable conditions," defined as "significant deficiencies in the design 
or operation of internal control, which could adversely affect the organization's ability to 
initiate, record, process, and report financial data consistent with the assertions of manage- 
ment in the financial statements" (SAS No. 60, AICPA 1988b, AU 325). When these 


5 Reportable events are not the same as reportable conditions. The latter refers specifically to internal control, and 
the former includes the latter. Financial Reporting Release No. 31 defines reportable events as situations ‘where 
the accountant has advised the registrant that it questions the accuracy or reliability of the registrant's financial 
statements, management’s representations, the registrant's internal controls or prior audits" (SEC 1988, 1141). 
Although the other reportable events may be correlated with the occurrence of internal control problems, they 
are distinct from the latter. 
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reportable conditions are very severe, they are designated “material weaknesses.” A ma- 
terial weakness is “a reportable condition in which the design or operation of one or more 
of the internal control components does not reduce to a relatively low level the risk that 
misstatements caused by error or fraud in amounts that would be material in relation to the 
financial statements being audited may occur and not be detected within a timely period 
by employees in the normal course of performing their assigned functions" (SAS No. 60, 
AICPA 1988b, AU 325). Thus, a material weakness is a reportable condition, but a re- 
portable condition may not be a material weakness. 


Audit Committee Quality 

Definitions of an effective audit committee vary (DeZoort et al. 2002). One expects 
both audit committee composition and audit committee activity to be relevant. In general, 
. previous empirical research (Carcello and Neal 2000; Klein 2002b), has focused on com- 
position, possibly because composition is visible while activity is not. In 1999, the major 
stock exchanges introduced (with endorsement by the SEC) new requirements for audit 
committee composition and structure (e.g., SEC 1999). All listed companies must appoint 
audit committees with the following characteristics: (1) at least three members, (2) all 
members (except in restricted circumstances) independent of management, and (3) at least 
one member with financial expertise. 

Consequently, I use three measures, size (ACSIZE), independence (ACINDEP), and 
financial expertise (ACEXP) to proxy for audit committee quality. ACSIZE is a dummy 
variable coded 1 if the number of members is three or more, and 0 otherwise. ACINDEP 
is the proportion of members that are independent directors. ACEXP is the number of audit 
committee members with financial expertise. The source for these variables, and their con- 
struction, are discussed in Section III. 


Other Monitors 


Other monitors——management, the external and internal auditors, and the board— 
interact with the audit committee to determine the entity's internal control. Because the 
other monitors can also have independent effects on internal control, and one cannot observe 
the various interactions between these monitors and the audit committee, I include controls 
representing the other monitors. 

Three variables proxy for management quality. First, the extent of formal training of 
top management in the accounting and finance functions would influence the likelihood of 
emergence of internal control problems. ACCTEXP indicates the presence of personnel, 
Chief Financial Officer (CFO) or Controller or Chief Accounting Officer (CAO), with a 
CPA certification or previous experience in a CPA firm. Second, top managers may lack 
formal training but possess specific experience as overseers of the accounting or finance 
functions. CFOEXP indicates whether the CFO (or Controller or CAO) has had prior ex- 
perience in one or more of these positions in other employment. I expect both of these 
variables to be negatively associated with the presence of internal control problems. Third, 
management integrity is a major determinant of internal control quality. While management 
attitudes are not observable, I use a proxy, FRAUD, based on observed data to capture 
management integrity. FRAUD indicates whether the firm faced SEC action relating to 
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fraudulent financial reporting, and/or restated its financial statements due to GAAP viola- 
tions, at the time of the 8-K filing or later. 

Three auditor characteristics are included, auditor industry share (ASHARE), auditor 
tenure (TENURE), and auditor type (BIG5). External auditor control variables are needed 
because, as noted, auditors may have an independent effect on the quality of control due 
to their charge to assess it. These variables also serve to control for differences in auditors' 
discovery and reporting of the problem. 

Auditor industry specialization (ASHARE, measured as the auditor's share in total assets 
in the firm's two-digit industry) can proxy both for the ability to detect problems due to 
knowledge of industry-specific issues (Craswell et al. 1995), and its independence. 

Prior literature suggests auditor tenure can affect audit quality, auditor independence, 
and auditor reporting. As auditor tenure increases, client-specific knowledge is enhanced 
possibly improving audit quality. Myers, Myers, and Omer (2003), Frankel et al. (2002), 
and Johnson et al. (2002) report negative associations between auditor tenure and measures 
of discretionary accruals, and Geiger and Raghunandan (2002) report a negative association 
between auditor tenure and the likelihood of failures in auditor reporting. Therefore, I 
include auditor tenure (TENURE) as a control variable, and predict a negative sign for it. 

The Big 5 auditors are widely characterized as “brand-name” and “high-quality” pro- 
viders, and as the providers of “differentiated” audits (e.g., Craswell et al. 1995; Francis 
et al. 1999; Krishnan 2003). A priori, this suggests they may do better than non-Big 5 
auditors in detecting and reporting internal control issues. However, evidence on Big 5/ 
non-Big 5 differences in error detection does not indicate significant differences (DeFond 
and Jiambalvo 1991; Petroni and Beasley 1996). In addition, clientele differences between 
the two auditor types—due to differential screening practices by auditors—can result in 
differences in clients’ inherent propensity to generate internal control problems.’ Thus, I 
include a dummy variable (BIG5) to control for auditor type, but do not predict a sign 
for it. 

A dummy variable for the presence of an internal audit function (INTAUD) is included. 
Because the board can also serve as monitor, and audit committee characteristics can reflect, 
or be determined by, board composition (Beasley and Salterio 2001; Klein 20022), I include 
a board variable, BDINDEP, measured as the proportion of nonaudit committee directors 
that are independent as an additional control (Carcello and Neal 2003). I predict negative 
signs for both INTAUD and BDINDEP. 


Other Control Variables 


Other variables control for a variety of factors that might impact internal control quality. 
It is generally expected that larger firms have better internal controls (DeFond and 
Jiambalvo 1991). However, evidence on the association between firm size and accounting 
errors (which is expected to be negatively correlated with the quality of control) is mixed, 
as Kinney and McDaniel (1989) note, with some studies reporting a negative association 
(e.g., Wright and Wright 1996) and others finding no significant difference (e.g., DeFond 


$ Other factors, identified in SAS No. 55 (AICPA 19888), such as ethical values of management or human resource 
policies and practices, are difficult to measure using archival data. For example, Cohen and Hanno (2000), 
examine in an experimental setting the effect of corporate governance and management control philosophy on 
auditors’ planning judgments. Other studies examine the impact, in experimental settings, of internal control on 
audit planning judgments and financial statement errors. See Bell and Wright (1995) for a review of this 
literature. 

? Other factors affecting Big 5/non-Big 5 audit quality differences are differences in the provision of internal 
audit and nonaudit services in conjunction with audit services. Due to leck of data, I do not investigate the 
effect of these factors. 
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and Jiambalvo 1991). Studies relating to financial fraud however have generally found a 
negative association between firm size and the incidence of fraud (e.g., Beas]ey et al. 2000). 
Size is measured by the natural logarithm of total assets (LASSET). I do not predict a sign 
for LASSET. 

Firms that grow too rapidly may outgrow their control systems (Kinney and McDaniel 
1989; Stice 1991). However, slow growth can also indicate bad performance and a conse- 
quent inability to invest in proper controls. Kinney and McDaniel (1989) find, for example, 
that the incidence of error correction (which is expected to be positively associated with 
weak internal control) is higher for low-growth companies, possibly indicating bad controls 
for low-performing (slow-growth) companies rather than high-performing (high-growth) 
companies. To capture extreme growth levels, high growth (HIGHGR) is measured by a 
dummy variable, coded 1 for growth rate in assets (in the year before auditor change) 
greater than the industry (two-digit SIC) median, and 0 otherwise. I do not predict a sign 
for HIGHGR. The number of years that a company has been publicly traded has been 
shown to be associated with fraud and restatements (Beneish 1999; Myers, Myers, 
Palmrose, and Scholz 2003), which in turn are likely to be associated with weak internal 
control. I include IPO, a variable that indicates whether the company went public within 
the last five years before the year in which it appears in my sample. Studies also indicate 
that fraud is most often related to revenue recognition (e.g., Beasley et al. 2000) and 
restatements are most often related to revenue and receivables (Palmrose and Scholz 2004). 
Consequently, I control for the ratio of receivables to assets (RECASST). 

Financial distress may prevent an entity from investing in the maintenance of proper 
internal control. Distress is measured by two variables: the probability of bankruptcy using 
the Zmijewski (1984) model (PRBANK) and a dummy variable (LOSS) indicating a loss 
during the current year. I include a control for the time period 1999—2000 (YEAR), because 
a number of regulatory changes occurred during this period, some of which could affect 
audit committee composition. I also include two dummy variables (NASD and NYAM) 
indicating the National Association of Securities Dealers Automated Quotation System 
(NASDAQ)-National Market System and New York/ American stock exchanges. 

Finally, I include a control, RESIGN coded 1 for an auditor resignation, and 0 otherwise. 
The SEC requires disclosure about the nature of auditor change because it believes these 
disclosures could reveal troublesome audit situations. Previous work (Krishnan and 
Krishnan 1997; Raghunandan and Rama 1999) has shown that companies whose auditors 
resign are different from companies whose auditors are dismissed. The inclusion of RESIGN 
controls for these differences. 


Choice of Control Sample 


The test sample consists of companies that switched auditors and reported internal 
control problems on their Forms 8-K. The control sample is a sample of auditor switchers 
that did not report internal control problems on their Forms 8-K.* However, one must 
consider the likelihood that such a sample may in fact include situations where the company 
has (unreported) internal control problems. The control sample may not be free of internal 
control problems if (1) the predecessor auditor failed to identify or report internal control 
problems during its tenure with the client and/or (2) the client failed to report on the Form 
8-K that the auditor had, by written or oral report, notified the client of the problem. 


* An alternative control sample is a random sample of companies that did not change auditors. However, since 
the internal communications about internal control are not publicly available, one cannot ascertain the absence 
of internal control problems for such a sample. 
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Failure by the auditor to identify an existing internal control problem relating to finan- 
cial reporting would likely reflect the quality of the audit. Likewise, failure to report a 
problem when identified would depend on auditor quality and independence, and on the 
auditor's concern with potential litigation and loss of reputation. Controls for auditor dif- 
ferences are included to address this possibility. 

As for (2) above, the client is unlikely to avoid reporting an internal control problem 
on its Form 8-K because the reporting process for auditor changes requires the auditor to 
endorse the contents of the Form 8-K filed by the client. Further, the reporting requirements 
state that, in the absence of reportable events, the registrant must specifically state there 
were no reportable events. This puts pressure on the registrant to make the correct disclo- 
sures. Krishnan (2002) finds that situations where the client makes inadequate or inaccurate 
disclosures (as revealed later by the auditor) on the Form 8-K are relatively infrequent, and 
that such situations are met with negative stock market reactions. Because clients are likely 
to be concerned about such negative consequences, and auditors are likely to be concerned 
about litigation and reputation loss, situations where the problem is not revealed by the 
client or subsequently by the auditor will be relatively few. 

In sum, while one cannot rule out the possibility that the control sample is not free of 
internal control problems, I expect that such occurrences would be relatively few. Also, 
any resulting bias would work against finding differences between the test and control 


samples.? 


HI. DATA 

I identified (from the Disclosure Inc. database) an initial sample of companies that 
changed auditors over the period 1994—2000 and whose Forms 8-K were available on the 
SEC’s EDGAR database.'? I searched the Forms 8-K for disclosures of internal control 
problems. This yielded 204 observations. I dropped 40 companies that did not have audit 
committees. À total of 36 observations were lost because of the unavailability of filings 
and/or incomplete audit committee or board data (29 observations), or because of incom- 
plete data for one or more of the other variables (seven observations). This yielded an 
"internal control" (INTCONT) sample of 128 companies. 

The financial variables (obtained from the Disclosure database) pertain to the last fiscal 
year of the predecessor auditor. I constructed the audit committee variables using infor- 
mation in proxy statements, and two of the management quality variables using information 
in proxy statements and 10-K filings.'! Following prior work, I define a nonindependent or 
affiliated director as "current or former officers or employees of the company or of a related 
entity, relatives of management, professional advisors to the company, (e.g., consultants, 


? If the likelihood that the disclosure is contained in the 8-K is systematically associated with audit committee 
characteristics (for example, because a good audit committee would ensure that the disclosure is made), then 
audit committee quality could actually be positively (rather than negatively as hypothesized in this paper) as- 
sociated with the disclosure of internal control problems. As discussed, it seems likely that the failure to disclose 
the problem would be related more to auditor quality. Thus, controls for auditor differences control for such 
potential disclosure differences. 

19 T used the June and December editions for the years 1996-2000 and the June 2001 edition of the Disclosure 
database. To identify potential auditor changes, I used two techniques: & keyword search, using terms like 

"resign," "accountant," “auditor change," “dismiss”; and examination of items listed under 8-K item 4 filings 
on the Disclosure database. 

ti To ensure that the measurement period of the audit committee variables coincided with the period of auditor's 
assessment of internal control, I examined proxy statements dated cIose to the auditor change date. On average, 
the proxy filing dates were 46 (39) days after the date of auditor change for the INTCONT (NOPROB) sample. 
The median difference between the proxy date and the auditor change date is 53 days for both samples. 
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bank officers, legal counsel), officers of significant suppliers or customers of the company, 
and interlocking directors" (Carcello and Neal 2000, 453—454). Using information in the 
proxies about the employment history of each director, family relationships, and other 
significant business relationships, I classify audit committee members as independent di- 
rectors if they are not affiliated (Beasley 1996; Klein 2002b). As in Beasley et al. (1999), 
I classified each audit committee member as a financial/accounting expert if s/he was a 
Certified Public Accountant, Chartered Financial Analyst, or had work experience in a 
financial or accounting position. Using information about past experience of officers, I 
coded ACCTEXP as 1 if the CFO or CAO or Controller had public accounting experience, 
and 0 otherwise. If the CFO or CAO or Controller had prior experience in a similar capacity, 
then I coded CFOEXP as 1, and 0 otherwise." 

To construct FRAUD, I searched the SEC website and newswires and newspaper articles 
using the Lexis/Nexis database. From the SEC website, I obtained information about any 
action taken by the SEC against the company or its current or former management involving 
fraud or restatements due to GAAP violations starting from the period of filing of 8-K 
through Januarv 2004. I searched major newspapers and newswires using keywords such 
as “restate,” “restatement,” and "fraud" to identify companies that engaged in fraud or 
restated their financials due to GAAP violations. 

The internal audit variable was constructed, following Beasley et al. (2000), by ex- 
amining whether the proxies mentioned the presence of an internal audit function.'? Auditor 
market share (based on client assets) was constructed for each sample year and for each 
auditor-two digit SIC industry combination using data from the Disclosure database, using 
all observations with valid data. I computed the number of years the auditor had been with 
the auditor (TENURE), using Compustat and the Disclosure database for most firms, and 
proxy filings for a few.!^ 

A control sample of companies without internal control problems (NOPROB) was gen- 
erated from the population of other auditor changes. For each company in the INTCONT 
sample, a random choice for the control sample was made from among the companies that 
changed auditors using three criteria: (1) Each INTCONT company was matched with a 
NOPROB company by industry. Matching by industry is desirable because prior research 
(Beasley et al. 2000) finds that the likelihood of fraud is higher in certain industries, and 
there is a direct relation between weak internal control and the incidence of fraud (Bell 
and Carcello 2000). Industry matching was based on three-digit SIC codes. Where it was 
not possible to match on three digits, two digits (29 observations) and one digit (14 obser- 
vations) were used; (2) Each INTCONT company was matched with a NOPROB company 
on the exchange on which the company was listed. Matching on exchange is necessary 
because listing requirements relating to the board of directors are different for the New 
York Stock Exchange (NYSE), American Stock Exchange (AMEX), and Over-the-Counter 


12 T used keyword searches in proxies and 10-K filings for "chief f” “chief a," or "controller" and variants (e.g., 
“CFO,” “CAO,” and "finance") to identify the background information on these individuals. In general, the 
experience was reported for the past five years, as required by SEC Regulation S-K. For many companies, 
information was provided for more than five years. Inter-company variation in the number of years covered by 
these disclosures and the extent of detail about past experience introduces noise in the management quality 
variables, but such noise is likely to bias the coefficient estimates toward the null hypothesis. 

13 This variable may not completely capture the presence of an internal audit function because its construction 
relies on the proxies containing nonmandated disclosures about the internal audit function. 

^ For nine companies, I had to rely on the Disclosure database, but because my access to that database goes back 
only to 1988, the measure of tenure is truncated. However, the results remained qualitatively unchanged when 
I dropped these nine observations. 
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(OTC) markets; (3) The control sample was also chosen such that the company did not 
report any reportable events or disagreements. The final sample has 128 INTCONT com- 
panies and 128 NOPROB companies. 


IV. EMPIRICAL RESULTS 
Table 1, Panels A—C show the sample composition, by year, by industry, and by 
exchange. The sample ranges from 1994 through 2000, with most observations occurring 
in 1997 through 1999. The number of observations prior to 1997 is low because this was 


TABLE 1 
Sample Composition 


Panel A: Composition of Sample, By Year of Auditor Change 


Year of INTCONT NOPROB 
Change" Sample? Sample 
1994 6 2 
1995 14 10 
1996 13 21 
1997 28 27 
1998 20 31 
1999 31 27 
2000 46 .10 
Total 128 128 
Panel B: Composition of Sample, By Industry 
INTCONT® NOPROB* 
Industry (SIC) Sample Sample 
Mining and Construction (10—17) 3 | 3 
Manufacturing (20—39) 39 40 
Transportation and Utilities (40—49) 17 16 
Wholesale and Retail (50—59) 11 12 
Financial Services (60—69) 13 13 
Services (70—89) _45 _44 
Total 128 128 
Panel C: Composition of Sample, By Exchange 
INTCONT* NOPROB* 
Exchange Sample Sample 
NYSE 16 10 
AMEX 8 9 
OTC 104 109 
Total 128 128 


" Last fiscal year end of the predecessor auditor. 

b INTCONT sample consists of companies whose Forms 8-K reporting auditor changes contained disclosures 
relating to internal control problems. NOPROB sample is a control sample of companies that changed auditors 
but reported no internal control problems (or any other reportable event or accounting disagreements) in their 
Forms 8-K. l : 
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the phase-in period during which the SEC gradually inducted companies into EDGAR, 
starting with larger companies. The phase-in period ended in May 1996, after which all 
companies were required to make filings through EDGAR.” 

There are some industry-specific patterns in the Incidence of internal control problems. 
Services has the greatest proportion (35.2 percent) followed by Manufacturing (30.5 per- 
cent). Within Services (not reported in the table), the incidence of internal control problems 
is concentrated (20.3 percent) in one two-digit SIC industry (73), Business Services. Within 
Manufacturing, the incidence of internal control problems is concentrated (19.5 percent) in 
three two-digit SIC groups, 35 (Industrial Machinery and Equipment), 36 (Electronic and 
Other Electric Equipment), and 38 (Instruments and Related Products). Panel C of Table 1 
indicates that the sample companies are predominantlv over-the-counter (OTC) companies: 
there are 8 (9) companies from the American Stock Exchange (AMEX) in the INTCONT 
(NOPROB) group, and 16 (10) companies from the New York Stock Exchange (NYSE) in 
the INTCONT (NOPROB) group. !* 

Despite the attempts to match on industry and exchange, Table 1 indicates that the 
matching is less than perfect. Two companies (one in Transportation and one in Services) 
have matches in different industries. However, dropping these two observations and their 
corresponding matched observations does not change the results I report later. The imper- 
fection in matching is greater with respect to exchange affiliation. In particular, there were 
not a sufficient number of observations in some industry groups from the NYSE. Therefore, 
one observation from AMEX and five observations from OTC were chosen as matches for 
six NYSE observations. I include two dummy variables for exchange affiliation to control 
for the potential effect of these mismatches. 

Table 2 reports descriptive statistics. The univariate tests indicate that the two samples 
differ in audit committee characteristics. NOPROB companies are more likely to have audit 
committees of three or more members (ACSIZE), have a greater proportion of independent 
directors (ACINDEP) on the audit committee, and a larger number of members with finan- 
cial expertise (ACEXP) than INTCONT companies. NOPROB companies also have a larger 
proportion (not reported in the tables) of wholly independent audit committees (49.2 per- 
cent) compared to INTCONT companies (39.8 percent). About 45 percent (61 percent) of 
the INTCONT (NOPROB) sample (not tabulated) had at least one member with financial 
expertise. This difference is significant with p-value « 0.01. 

Two of the management quality variables, CFOEXP and FRAUD, are significantly 
different: INTCONT companies have a lower proportion of CFOs with previous experience 
in a similar capacity (CFOEXP), and are more likely to be associated with SEC action or 
restatements (FRAUD). INTCONT companies are financially more stressed, as measured 
by LOSS, have shorter auditor tenures (TENURE), and are more likely to have had IPOs. 


15 The industry and exchange matching procedure minimizes any selectivity bias that may arise from the inclusion 
of companies from the phase-in period. Also, models that were estimated after eliminating the observations from 
the phase-in period yielded results that were similar to the reported results. 

16 Further information for the OTC companies was not always available. Where information could be obtained, 58 
(50) of the INTCONT (NOPROB) companies were traded on the NASDAQ-National Market System, 16 (24) 
on the NASDAQ-Small-Cap Market, and 18 (16) on the Bulletin Board and Pink Sheets. For the remaining 
companies the information available from Forms 10-K or newspaper articles indicated only that it was traded 
either in NASDAQ (12 and 15 companies in the INTCONT and NOPROB samples, respectively) or in the Over- 
the-Counter market (4 companies in the NOPROB sample). 
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TABLE 2 
Descriptive Statistics for INTCONT and NOPROB Samples 
INTCONT* 
Expected Sample NOPROB* Sample 
Variable — Sign = Mean Median Mean Median t-statistic? Wilcoxon Z° 
ACSIZE = 0.547 1 0.641 1 —1.53* —1.52* 
ACINDEP = 0.681 0.670 0.771 0.815 —2.54*** —224** 
ACEXP -— 0.531 0 0.789 1 —2.9]*** "2. 72*9** 
ACCTEXP — 0.219 0 0.234 0 —0.30 —0.30 
CFOEXP = 0.508 ] 0.680 1 —2.83*** -2.]9*** 
FRAUD + 0.242 0 0.055 0 4.36*** 4.21*** 
ASHARE + 0.145 0.100 0.116 0.072 1.26 1.94** 
TENURE = 5.032 4 6.492 4 —2.07** —1.80** 
BIGS += 0.859 j| 0.773 1 1.78* 1.77* 
INTAUD ` = 0.188 0 0.211 0 —0.47 —0.47 
BDINDEP = 0.323 0.333 0.347 0.333 —0.70 —0.83 
LASSET +/— 4.148 3.962 3.787 3.600 1.55 1252 
HIGHGR +/— 0.516 1 0.492 0 0.37 0.37 
IPO + 0.430 0 0.352 0 1.28* 1.28* 
RECASST + 0.230 0.209 0.226 0.181 0.19 0.89 
PRBANK + 0.105 0.0002 0.099 0.00003 0.19 1.74** 
LOSS + 0.727 1 0.586 ] 2.30*** 236 49 
YEAR +/— 0.367 0 0.289 0 1.33 1.33 
NASD +/— 0.453 0 0.391 0 1.01 1.01 
NYAM +/— 0.188 0 0148 ` O 0.83 0.83 
RESIGN + 0.445 0 0.117 0 6.25*** 5.83*** 
n 128 128 | 


i 


*** ** * Significant at 1 percent, 5 percent, and 10 percent levels, respectively, one-tailed where signs are 
- predicted, two-tailed otherwise. 
* INTCONT sample includes companies whose Forms 8-K reporting auditor changes contained disclosures 
relating to internal control problems. NOPROB sample is a control sample of companies that changed auditors 
but reported no internal control problems (or any other reportable event or accounting disagreements) in their 
Forms 8-K. 
b Tests for significant differences in means. 
* Tests whether the observations in the two groups are from populations with different distribution. 
Variable Definitions: 
ACSIZE = 1 if audit committee has at least three members, and 0 otherwise; 
ACINDEP = proportion of independent outside members on the audit committee; 
ACEXP = number of audit committee members with financial expertise; 
ACCTEXP = 1 if the Chief Financial Officer (or Chief Accounting Officer or Controller) has CPA certification 
or previous experience in public accounting, and 0 otherwise; 
CFOEXP = 1 if the Chief Financial Officer (or Chief Accounting Officer or Controller) has previous 
experience in a similar capacity with another company, and 0 otherwise; 
FRAUD = | if there was an SEC action against the company’s management or the company restated 
financials due to GAAP violation, and 0 otherwise; 
ASHARE = auditor’s share in two-digit industry; 
TENURE = number of years the auditor has audited the client; 
BIGS = 1 if the predecessor auditor is Big 5, and 0 otherwise; 
INTAUD = 1 if internal audit function exists, and 0 otherwise; 
BDINDEP = proportion of independent outside members on the board, excluding the audit committee; 
LASSET = natural logarithm of total assets (in million); 


(continued on next page) 
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HIGHGR = 1 if growth rate in assets above the industry median, and 0 otherwise; 
IPO = 1 if the company had an IPO within five years of the last year of the predecessor auditor, and 0 
otherwise; 
RECASST = ratio of receivables to total assets; 
PRBANK - probability of bankruptcy using Zmijewski's (1984) model; 
LOSS = 1 if net income is negative, and 0 otherwise; 
YEAR - 1 if the observation is from 1999 or 2000, and 0 otherwise; | 
NASD = | if the stock was traded in National Market System of NASDAQ, and 0 otherwise; 
NYAM = 1 if the stock was traded in NYSE or AMEX, and 0 otherwise; and 
RESIGN = 1 if the auditor resigned, and 0 otherwise. 


INTCONT companies are more likely to have Big 5 auditors than NOPROB companies. 
Auditor resignations are also more likely for INTCONT companies.? Finally, the 
INTCONT sample has higher medians but not means for ASHARE, and PRBANK. 

Table 3 presents correlations. All correlations between the independent variables are 
below +/— 0.4. Further, investigation of variance inflation factors—all of which were under 
1.6—suggests multicollinearity is not a concern. 

Table 4 presents logistic regression estimates for the incidence of internal control prob- 
lems. The dependent variable, ICPROB is coded 1 or 0 according to the presence or absence 
of internal control problems. The model is significant, as indicated by its Chi-squared value. 
Two kinds of tests are used to examine the significance of coefficient estimates. Column 2 
presents p-values based on conventional logistic estimates. Column 3 presents the results 
of nonparametric randomization tests, which examine the generic hypothesis that a set of 
variables is unrelated to the dependent variable (Noreen 1989). The test is conducted by 
shuffling the dependent variable randomly across the observations and estimating the logit 
model. This process is repeated 1,000 times, generating a distribution of coefficients that 
can be compared with the original coefficients estimated with the unshuffled data. If the 
coefficient is predicted to be positive, then the significance of the coefficient (randomized 
p-value) is given by (VGE + 1)/(NS + 1) where NGE is the number of (shuffled) coeffi- 
cients that exceed the original coefficient, and NS is the number of times the shuffling is 
done. If the coefficient is predicted to be negative, NGE is defined as the number of times 
the (shuffled) coefficients is less than the original coefficient (DeFond and Jiambalvo 1991). 
The test statistic is not constructed for coefficients for which a sign is not predicted. 

A comparison of columns 2 and 3 in Table 4 indicates that the results of the random- 
ization tests closely parallel the conventional tests. The proportion of independent members 
on the audit committee (ACINDEP) and the number of expert members (ACEXP) are neg- 
atively associated with the likelihood of an internal control problem (p-value — 0.005 and 
0.014, respectively, in column 2, and p-values of 0.002 and 0.011, respectively, in column 
3). Two of the management quality proxies (CFOEXP and FRAUD) are strongly significant. 
Thus situations where the CFO (or equivalent officer) has had prior experience in a similar 


7 [f the Big 5 report internal control problems more often, then the difference in the means suggests that the 
NOPROB sample may have undetected or unreported internal control problems. This shortcoming in the data 
cannot be overcome, but would bias against my finding significant differences between the samples. Also, as I 
report later, Í estimate the models for the Big 5 alone, and find similar results. 

33 1 examined univariate differences among INTCONT and NOPROB companies within the resignation and dis- 
missal subsamples. There are two differences. The proportion of companies with three or more audit committee 
members (ACSIZE) is significantly higher for dismissals as compared to resignations for both the INTCONT 
and NOPROB companies. À greater proportion of INTCONT resignation companies have high growth 
(HIGHGR) as compared to INTCONT dismissal companies. 
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RESIGN 


Chi-Squared^ 


(p-value) 
Pseudo R? 
n 


Logistic Regression for the Incidence of Internal Control Problems* 


Expected 


_ Sign 


+/— 


Krishnan 
TABLE 4 
Coefficient Logistic Randomization 
Estimate p-value Test p-value 
_ ü _ (3) 
—0.074 0.927 — 
—0.095 0.387 0.378 
—1.467*** 0.005 0.002 
—0.487** 0.014 0.011 
0.091 0.404 0.599 
—0.917*** 0.003 0.005 
1.769*** 0.000 0.000 
0.363 0.351 0.351 
—0.055* 6.082 0.030 
0.036 6.939 — 
0.199 0.313 0.713 
—0.496 0.191 0.167 
0.230** 0.045 — 
—0.207 0.513 — 
0.076 0.418 0.401 
0.169 0.424 0.416 
0.447 0.233 0.190 
0.875*** 0.009 0.006 
0.207 0.568 — 
0.341 0.351 — 
0.297 0.562 — 
1.668*** 0.000 0.000 
88.567 
(0.000) 
0.250 
256 


*** ** + Significant at 1 percent, 5 percent, and 10 percent levels, respectively, one-tailed where signs are 
predicted, two-tailed otherwise. 


* Dependent Variable: ICPROB = 1 for INTCONT companies, and 0 for NOPROB companies. 


b The Chi-squared statistic and the Pseudo R? are for the logistic regression model. The coefficient estimates and 
the corresponding p-values for the logistic regression model are reported in columns 1 and 2. The p-values 
reported in column 3 are generated using Noreen's (1989) randomization procedure. Sec text for details. 

Sec Table 2 for definition of independent variables. 


capacity are less likely to be associated with internal control problems. Situations of po- 
tential fraud are characterized by a higher incidence of internal control problems.!? 


79 To test whether the effect of the audit committee variables varied with management quality and/or resigna- 
tion status, I also examined the effects of interactions between (1) the audit committee variables (ACEXP and 
ACINDEP) and the management quality variables (CFOEXP and FRAUD), and (2) the audit committee variables 
(ACEXP and ACINDEP) and RESIGN. None of the interactions was significant. The audit committee variables 
remained significant. Further, I examined the effect of the interactions between (3) the two management quality 
variables, CFOEXP and FRAUD, and (4) the management quality variables (CFOEXP and FRAUD) and RESIGN. 
Again, nonc of the interactions was significant. The results pertaining to audit committee variables remained 


unchanged. 
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Among the other contro] variables, financial distress (measured by LOSS) is positively 
associated with, and auditor tenure is negatively associated with, the likelihood of an in- 
ternal control problem. Consistent with the univariate results, firm size (LASSET) is posi- 
tively associated with the likelihood of internal control problems. This runs counter to the 
generally expressed view that larger firms have better controls, although as noted earlier 
there is no consistent evidence to support this view. It is possible that the expected asso- 
ciation between control quality and firm size does not hold for small and medium sized 
companies that comprise the current sample. Finally, as indicated in the univariate analysis, 
resignations (RESIGN) are positively associated with internal control problems. 


Severity of Internal Control Problem 


Next, I examine the severity of the internal control problems, that is, whether the 
problem is a reportable condition (REPCOND) or a material weakness (MATWEAK). Of 
the 128 companies in the INTCONT sample, 74 had material weaknesses relating to internal 
control. A multinomial logit model is presented in Table 5. The dependent variable, ICSEV, 
is coded 2 for a material weakness, 1 for a reportable condition, and 0 for no problems. A 
priori, one would expect a negative association between variables representing audit com- 
mittee effectiveness and the severity of the internal control problem. Columns 1—6 present 
the coefficients for the effects of the variables on the likelihood of a reportable condition 
(columns 1—3) and material weakness (columns 4—6) relative to the likelihood of there not 
being a problem. In column 7, I present the difference in the coefficients to test whether 
the coefficient is significantly different across the two levels. Because the power of these 
tests is generally lower than that of the logit (due to a smaller number of observations in 
subcategories of the data), I focus only on the results that are significant according to both 
the conventional t-test and the randomization tests. 

Results in columns 1-6 indicate that, with the exception of TENURE, which is signif- 
icant for reportable conditions but not for material weaknesses, the results in Table 4 carry 
over to both levels of internal control problems. ACINDEP and ACEXP are significant. 
CFOEXP and FRAUD are negatively and positively associated with both levels of internal 
control problems, respectively. Firms with losses are more likely to have both reportable 
conditions and material weaknesses relative to NOPROB companies. RESIGN is positively 
associated with both kinds of internal control problems.” 

Column 7 indicates that there are relatively few differences between the coefficients 
for the two kinds of problems. Audit committee characteristics do not affect the relative 
likelihood of a material weakness versus a reportable condition. Thus, while audit com- 
mittee independence and expertise lower the likelihood of an internal control problem, they 
do not affect the relative likelihood of one type of problem compared to the other. Three 
factors are more likely to be related to material weakness than to reportable conditions: 
FRAUD, BIG5, and PRBANK. Thus, the severity of the internal control problem is greater, 
not surprisingly, in situations of subsequent fraud, and greater financial distress. In addition, 
Big 5 firms are more likely to be associated with material weaknesses than with reportable 


20 The definition of financial expertise used in this paper is narrower than the BRC definition, but similar to the 
SEC's (1999) definition. For the BRC, a financial expert also includes “being or having been a CEO or other 
senior officer with financial oversight responsibilities" (BRC 1999, 25). As a result, the incidence of financial 
expertise in my sample is lower than that in studies that use the BRC's definition (e.g., Carcello and Neal 2003). 
As a sensitivity exercise, I re-estimated the models using the BRC definition. ACEXP is negative and significant 
in the logit, and negative and significant for material weaknesses and insignificant for reportable conditions in 
the multinomial logit. 
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conditions. This could reflect differences in reporting or differences in clientele across Big 
5 and non-Big 5 firms. 


V. ADDITIONAL TESTS 
Alternative Audit Committee Quality Measures 


I examined other measures of audit committee effectiveness. First, I classified the audit 
committee as being of high quality (JNDEP) if all members are independent (Klein 2002b). 
Second, I classified an audit committee as being of high quality (J[NDEP-EXP) if it has all 
independent members and at least one member with financial expertise (Raghunandan et 
al. 2001). Logistic regressions (Panel A, columns 1-2) and multinomial logits (Panel B, 
columns 1—4) using these two alternatives are presented in Table 6. The results for the 
other variables are similar in all cases and therefore not reported in this table. Both panels 
indicate that INDEP and INDEP-EXP are significant and negative. 


Auditor Dismissals 


Tables 4 and 5 indicated that auditor resignations are positively associated with internal 
control problems. As a sensitivity test, I also estimated the models using the sample of 
dismissals alone. I retained only pairs of observations, that is, INTCONT companies for 
which the matching NOPROB companies were also dismissal companies. The logistic and 
multinomial logit results are presented in Table 6, Panel A (column 3) and Panel B (columns 
5—6). Panel A indicates, consistent with Table 4, both ACINDEP and ACEXP are negative 
and significant, suggesting that the type of auditor change may not be driving the full 
sample results. In Panel B, ACINDEP is significant for both reportable conditions and 
material weaknesses and ACEXP is significant for reportable conditions but not for material 
weaknesses. 


Auditor Type 

As noted, the sample consists predominantly of Big 5 clients. To check that the results 
are not driven by the few non-Big 5 clients, I also estimated the models for the sample of 
Big 5 clients only (once again retaining only matched pairs). The results are reported in 
Table 6, Panels A (column 4) and B (columns 7—8). The results are generally consistent 
with Tables 4 and 5. ACINDEP is significant and negative in both panels. ACEXP is sig- 
nificant in Panel A and significant for material weaknesses in Panel B. 


Additional Audit Committee Characteristics 


Next, I extend the measures of audit committee effectiveness to audit committee char- 
acteristics other than independence and expertise. I add three variables suggested in prior 
work as additional measures of audit committee effectiveness: number of meetings (Abbott 
and Parker 2000), proportion of stock owned by audit committee members, and the average 
number of directorships held by the audit committee members in other companies (Carcello 
and Neal 2003). 

The expected signs for all three variables are ambiguous. The number of meetings may 
reflect how proactive the audit committee is in assessing internal control, but it may also 
be a response to the emergence of problems. Stock ownership by directors can have both 
positive and negative effects on director behavior. The positive effect, derived from agency 
theory, suggests ownership provides incentives to take actions to protect one's investment, 
including those that ensure proper financial reporting. The negative effect is the incentive 
to manipulate stock prices through inaccurate reporting. The number of directorships held 
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by audit committee members on the boards of other corporations is one element of gov- 
ernance expertise. On the one hand, the greater the number of outside directorships, the 
more the audit committee member's investment in obtaining governance expertise, and 
therefore more likely s/he will engage in effective monitoring activities. On the other hand, 
an increasing number of outside directorships can cause attention to be spread out thinly 
over the different assignments, thus lowering efficacy in any one firm. 

The three variables are not significant in the logistic or multinomial logits (results not 
tabulated), perhaps reflecting the fact that both positive and negative forces are at work. 
Other results remain unchanged. 


VI. DISCUSSION AND CONCLUSIONS 

Recent policy initiatives have focused on (1) improving corporate internal control, and 
(2) emphasizing more formally than before the role of the audit committee in maintaining 
effective control. An example of the first kind of policy initiative is the requirement that 
auditors attest to management's assertions about internal control (SEC 2003a).*! Examples 
of the second kind of policy are (1) the recent mandate that all auditor communication on 
internal control be specifically provided to the audit committee (SEC 2003b) and (2) the 
new SEC definition of an "audit committee expert," which includes familiarity with internal 
control issues as a component (SEC 2003c). Policy makers and the profession have always 
maintained that the audit committee should adopt the monitoring of control as a primary 
function. Yet, because internal control quality is not observable, little is known about how 
entities achieve high-quality internal control (Kinney 2000), and whether audit committees 
play a role (Cohen et al. 2002). 

In this paper, I provide evidence of (1) a negative association between the presence of 
internal control problems and audit committee independence and (2) a negative association 
between the presence of internal control problems and the number of audit committee 
members with financial expertise.” In addition, I find four other factors to be consistently 
associated with the incidence of internal control problems: managers' (CFO, CAO, or Con- 
troller) past on-the-job experience, management's tendency to engage in fraud, auditor 
tenure, and financial stress. Variables representing other monitors, such as the internal audit 
function, the external auditor, and the board of directors are not consistently related to the 
quality of internal control. When I partition the internal control problems into less severe 
(reportable conditions) and more severe (material weaknesses) problems, I find that the 
audit committee associations reported above hold for both levels of problems in the basic 
model. In sensitivity analyses, the negative significance of audit committee independence 
persists in all cases. Audit committee expertise is significant for at least one of the two 
levels of internal control problems in all sensitivity exercises. 

The results are consistent with the current policy emphasis on the independence and 
expertise of audit committee members. The adequacy of both requirements bas recently 
come under criticism (Lublin 2002; Whitehead 2002). The results pertaining to expertise 
are particularly interesting because a substantial proportion (55 percent and 39 percent of 
internal control and no-problem firms, respectively) of my sample did not have any member 
with expertise. The 1999 exchange rules (effective June 2001) can be expected to bring 


21 So far, management reports on internal control have been voluntary rather than mandated (Hermanson 2000). 

72 Because the sample time period precedes the effective dates for recent SEC policies regarding audit committee 
composition, there is enough variation in the data to allow inferences about the association between audit 
committee composition and internal control quality. 
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changes in audit committee composition as a result of companies complying with the re- 
quirement to appoint at least one financial expert to the audit committee. Moreover, the 
SEC's new definition of audit committee expertise includes as noted, “experience with 
internal controls and procedures for financial reporting” (SEC 2003c). Future studies can 
examine whether the enhancement to the expertise definition results in a closer association 
between audit committee quality and internal control quality. 

A shortcoming of this research, also noted by previous archival research in this area 
(e.g., Carcello and Neal 2003), is that it is a study of association, not causation. It is 
vulnerable to the possibility that unobserved factors that are correlated with both internal 
control and audit committee characteristics may drive the reported findings. To minimize 
this possibility, I use control variables representing other monitors of internal control, in- 
cluding the internal audit function, the external auditor, the board of directors, and the 
quality of management, as well as a number of variables (such as financial stress and 
growth) that may be related to the quality of internal control and the quality of the audit 
committee. The finding that the audit committee characteristics are associated with internal 
control after controlling for other monitors, suggests the audit committee has a contribution 
over and above the other monitors. 

A second shortcoming is that the sample consists of companies that changed auditors. 
Because reported internal control problems are only made public in situations of auditor 
change, the sample choice was unavoidable. Auditor change companies tend to be smaller 
in size than the general population of companies and are traded on the smaller stock ex- 
changes. This may affect the generalizability of the results of this study. However, it is 
generally small firms and firms on the NASDAQ and OTC exchanges that are vulnerable 
to fraudulent reporting, earnings restatements, and SEC action (Dechow et al. 1996; Bonner 
et al. 1998). To the extent that weak internal control contributes to these negative outcomes, 
this study focuses on the potential role of the monitoring provided by audit committees in 
avoiding such outcomes. My results are consistent with prior work that also uses restricted 
settings to test the effect of audit committees—for example, financially stressed firms 
(Carcello and Neal 2000) or auditor change firms (Carcello and Neal 2003)—as well as 
with Klein (2002b), which is based on a more general population of firms. 

Under the new SEC rules (SEC 20032), auditors will issue attestation reports on their 
clients' internal control. Because these reports must be filed by all filers and not just those 
changing auditors, future studies can examine the role of audit committee composition and 
other factors in determining the quality of internal control for a more general population 
of firms. 
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ABSTRACT: Thls paper examines how partners in an audit firm can use profit-sharing 
rules to induce optimal partner behavior from the firm’s point of view, taking into ac- 
count the strategic competition of firms in an auditing oligopoly. We use a linear con- 
tracting framework to investigate the effects of profit-sharing rules on individual part- 
ners' various decisions, including thelr pricing strategies and effort choices. We assume 
that efficient audits of different types of clients require different effort profiles with re- 
spect to degree of partner cooperation. For example, the audit of a complex company 
requires different amounts of partner collaboration than does the audit of a simple 
company. Moreover, since it is too costly for an enforcement party, such as the head 
office of an audit firm or a court, to verify each client's type in order to resolve com- 
pensation disputes among the firm's partners, it is reasonable to assume that client 
type cannot be contracted upon for partner compensation purposes. Given this as- 
sumption, we derive conditions under which there exists an equilibrium in which audit 
firms strategically choose different profit-sharing rules to specialize in different types 
of clients, thereby earning positive economic profits. Our analysis provides Insights into 
the strategic competition among the big audit firms, and helps to explain the observed 
differences in the compensation plans of these firms and in the nature of their client 
portfolios. 


Keywords: audit partnership; profit-sharing rules; oligopoly; specialization. 


I. INTRODUCTION 
n this study, we investigate incentives and profit-sharing rules in audit firms and their 
implications for the audit services market. Existing literature normally treats an audit 
firm as a single person "auditor," and pays limited attention to the decentralized nature 
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of actual firms in which individual auditors operate as partners.! Like other rational players 
in the economy, audit partners make decisions to maximize their own economic interests, 
which are influenced by the reward and control systems of the partnership. Hence, under- 
standing the profit-sharing rules of audit firms is crucial to understanding auditors’ behav- 
ior.^ As Zeff (2003) has commented, the manner in which audit firms reward their partners 
is a fundamental issue facing the accounting profession. 

Within an audit firm, partners make decentralized decisions in performing various 
tasks—including bidding for clients, expending effort in performing audits, and assisting 
one another in the performance of audits. A moral hazard problem (Holmstrom 1982) arises 
when an individual partner takes actions that are inconsistent with the interests of the 
partnership as a whole. For example, if the legal liability costs associated with an audit are 
shared equally among all partners in the firm, then the engagement partner's effort level 
can be expected to fall short of the optimal level from the firm's perspective, because the 
engagement partner does not fully bear the consequences of his effort choice. A principal 
objective of this study is to examine how partners in an audit firm can use profit-sharing 
rules that base an individual partner’s compensation on his performance—as reflected by 
audit fee revenues and liability costs from lawsuits—to induce optimal partner behavior 
from the firm's point of view. 

Audit firms are normally divided into two groups based on brand name or firm size: 
the Big 4 versus the non-Big 4, with the Big 4 firms dominating the audit services market.? 
For example, 73 percent of the 7,006 publicly traded companies included in Who Audits 
America (2003) were audited by the Big 4, and the rest were audited by 81 non-Big 4 
firms. With just a few firms dominating the market, it is reasonable to expect these firms 
to act as strategic competitors. In this study, we examine how audit firms can use their 
profit-sharing rules as strategic instruments to compete in an oligopolistic audit market. 

A number of studies have documented differences in the profit-sharing rules used by 
the big audit firms. For example, based on survey responses in the late 1980s and early 
1990s, Trompeter (1988, 1994) classifies the compensation plans of the then Big 8 (6) firms 
into "small pool" and "large pool" plans, according to the importance of “local office 
profitability" in determining an individual partner's compensation.* He reports: 


For Local Office Profitability, there is an interesting dichotomy in the [partners'] re- 
sponses. Office managing partners from three of the firms deemed this component to 
be unimportant in determining partner compensation. These firms base partner com- 
pensation on firm-wide profitability as opposed to office-wide profitability. Partners 
from the other firms rated this factor very important. (Trompeter 1988, 85) 


Using the “large-pool” versus "small-pool" classification, Trompeter (1994) and 
Carcello et al. (2000) investigate the effects of different profit-sharing arrangements on 
partners” reporting decisions, hypothesizing that partners from “small pool" firms are more 


! An exception is Narayanan (1995), but he treats profit-sharing rules as exogenous in the model. 

The terms "partner" and "auditor" are used interchangeably in this paper. 

The Big 4 are Deloitte Touche Tohmatsu, Ernst & Young, KPMG, and PricewaterhouseCoopers. The number 

of big audit firms shrank from eight to five through mergers in the 1980s and 1990s, and to four after the 

elimination of Arthur Andersen from the group as a result of the Enron debacle in 2001. 

* Burrows and Black (1998) provide similar observations. In particular, they report that the partners in the Mel- 
bourne office of Arthur Andersen (KPMG, respectively) share in the world wide profit pool (share only in the 
Melbourne office profit pool). 


w N 


The Accounting Review, April 2005 


Profit Sharing in an Auditing Oligopoly 679 


likely to compromise their objectivity and independence than are those from “large pool" 
firms. They find some evidence consistent with this prediction. 

While interesting, these studies do not consider certain fundamental questions: Why do 
the big audit firms use different compensation schemes? How do these differences affect 
partners' decisions in performing multiple tasks and the outcome of competition? By ad- 
dressing these questions, we seek to shed light on the strategic competition among the big 
audit firms and to provide an explanation for the observed differences in the compensation 
plans of these firms. 

In our analysis, we use a stylized model consisting of two audit firms, with two auditors 
(namely partners) in each firm, operating separately in offices located in two geographically 
different places. Audit production is viewed as a two-stage process. In the first stage, 
auditors from different firms in each location compete in bidding for clients. In the second 
stage, auditors in each firm cooperate in performing the audits for the clients obtained in 
the first stage. 

We assume that the efficient performance of an audit requires some degree of teamwork 
by partners (from different offices) within a firm. This assumption is consistent with the 
notion that the economic benefits of teamwork are the main reason for auditors to join one 
another in forming a partnership. In fact, one can reasonably argue that the big firms exist 
as multinational partnerships in order to facilitate collaboration among their offices (and 
partners) so as to more efficiently service multinational client companies? The potential 
benefits of collaboration may also occur when a client company operates in multiple in- 
dustry sectors. Ferguson et al. (2003) report evidence that industry specialization is a char- 
acteristic of specific big firm offices and partners—not the firm as a whole. Thus, the 
collaboration of industry-specialist firm offices (partners) may be needed to efficiently audit 
a client that operates in multiple industry sectors. 

While partner cooperation can be useful in many contexts, the amount of partner co- 
operation needed for an audit will likely vary across clients. Client companies differ greatly 
in size, geographic dispersion and complexity of their operations, extent of their operations 
in multiple industries, and other characteristics that can influence the degree of partner 
collaboration needed to produce an audit efficiently. For example, the appropriate level of 
partner collaboration may depend on the degree of industry-specific knowledge required 
for the audit, and the audit of a large international company probably requires more partner 
cooperation than the audit of a small local company. In our analysis, we capture such 
differences among audit clients by assuming that clients can be classified into two types: 
complex clients and simple clients. 

Turning to results, we find that, given that an efficient audit requires some degree of 
teamwork, it is optimal for partners within a firm not only to share the legal liability costs 
of audits, but also to share engagement revenues (i.e., audit fee revenues). Liability sharing 
between partners is essential for the motivation of teamwork because an auditor is willing 
to provide consultation/help to his partner if and only if he bears some responsibility for 
the audit failure of his partner's clients. To understand the need for revenue sharing, we 
observe that a partner would price below the firm's cost in bidding for clients if he received 
the whole amount of his engagement revenues. This occurs because the partner's personal 
cost of an audit is less than the firm's cost (since he bears only a part of the legal liability 
cost of the audit and ignores his colleague's cooperation/helping effort). Consequently, 


5 See, e.g., Cypert (1991), who gives an account of how the benefits of cross-justification cooperation of audit 
partners drove the mergers that formed KPMG. 
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partners should share each other's engagement revenues to counteract their incentives to 
bid below the firm's cost. 

Moreover, we find that firms' equilibrium compensation strategies differ significantly, 
depending on whether client type is contractable or is not contractable. When client type 
is contractable, firms have the same dominant strategy, consisting of client-type-specific 
sharing rules, which induce optimal partner behavior (in terms of effort profiles and pricing 
. policies) for each type of client. This is a perfect competition equilibrium in which auditors 
from different firms provide audits of the same value, bid the same price, and earn zero 
economic profit. 

However, we believe that, in the real world, client type is unlikely to be contractable 
because a client's operating characteristics that determine its "type" (i.e., complex versus 
simple and the resulting effort mix for an efficient audit) are generally difficult to verify. 
This seeming contracting disadvantage may, however, create an economic advantage for 
audit firms. We find that, when client type is not contractable and under suitable conditions, 
there exists a specializing equilibrium in which firms strategically use different compen- 
sation schemes to induce efficient audits for different types of clients, thereby gaining 
limited monopoly power over the clients in which they specialize. In equilibrium, each firm 
provides higher value audits for clients in its niche market than does the rival firm, and 
earns positive economic profits. These equilibrium results can explain the existence of both 

“large profit pool” and “small profit pool" big firms observed by Trompeter (1988, 1994) 
and can also help explain client-auditor alignments. For example, our theory.is consistent 
with the observation that the client portfolio of Arthur Andersen, a former “large pool 
firm," was concentrated in large, complex companies, while KPMG, a “small pool firm," 
was concentrated in smaller, simpler clients (U.S. Senate 1977; Liu 2002). We provide 
more discussion of these observations and their links to our analytical results in Section V. 

The remainder of the paper is organized as follows: Section II presents the model; 
Sections III and IV derive the equilibrium under the assumptions that client type is con- 
tractable and not contractable, respectively; Section V concludes the paper. A table of 
notation and all proofs are given in the Appendix. 


II. THE MODEL 
The Shareholders? Problem 


Consider a one-period model with two geographically different locations and two audit 
firms. All parties in the model are assumed to be risk-neutral and the interest rate is nor- 
malized to zero. Each audit firm has two partners operating separately in the two locations. 
This assumption captures a real-world situation in which an audit firm establishes offices 
in different states (and countries) with partners operating separately in different offices. A 
partner is assumed not to solicit clients outside his jurisdiction (namely location). We label 
the two locations 1 and 2, and the two firms A and B, so that Auditor Al, for example, 
refers to the partner of Firm À who operates in Location 1 (see Figure 1). The number 
of companies available for audit in each location is denoted by a positive integer n, i 
e {1,2}. 


$ We distinguish between the terms "accounting profit" and "economic profit." Accounting profit is the difference 
between the audit fee revenues and the liability costs, whereas economic profit equals accounting profit minus 
the auditors' effort costs. Unlike accounting profit, economic profit is not verifiable because only individual 
auditors have private information about their effort choices. In this paper, the term “profit” refers to accounting 
profit, unless otherwise noted. 
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FIGURE 1 
Auditors from Firms À and B in Two Geographically Different Locations 


Partner Al Partner A2 
Partner B1 Partner B2 
Location 1 Location 2 


At the beginning of the period, shareholders decide whether to continue investing in 
or liquidate their companies. A priori, a company is either a good company with probability 
$ € (0, 1) or a bad company with probability (1 — $), depending on its financial condition, 
which is not known to the company's shareholders. A good company is worth V > 0 if 
liquidated now, or returns V' > V at the end of the period if the investment is continued, 
whereas a bad company is worth J > 0 if liquidated now, or returns zero if the investment 
is continued. We assume that the expected return from continuing investment in a company 
is greater than that from immediate liquidation of the company, i.e.: 


ov > pV + (1 — oD. (1) 


Thus, shareholders will continue investing in a company based on their prior beliefs, and 
their (expected) payoff from the company is: 


oV. (2) 


The shareholders may engage an audit firm to perform an audit of the company's 
financial condition. To focus on the audit firm's problem, as is common in the literature 
(e.g., Dye 1995; Schwartz 1997), we assume that the audit firm is the sole preparer of an 
attestation report on the company's financial condition.’ Once the audit firm is engaged, 
the shareholders pay an audit fee, which is determined before the audit is performed (i.e., 
a fixed audit fee contract).? 

As discussed in Section I, we assume that the performance of an audit requires some 
degree of teamwork in the sense that the engagement partner's (that is, the audit firm's) 
attestation report is the outcome of partners' joint effort inputs in the audit process. We 
assume that a partner's effort can be measured in terms of personal costs, and that the 
partner's effort is additively separable across tasks and clients. Formally, for each client, 
let e € [0, e] be the cost of the audit effort by the engagement auditor (partner), and 


f In the real world, the role of an auditor is to render an opinion on whether the financial reports prepared by the 
company's manager fairly present the financial condition of the company. To simplify the analysis and to focus 
on the audit firm's problem, in the model we do not explicitly consider the agency problem and contracting 
issues between shareholders and managers, and assume that auditors prepare the financial 

° This assumption is consistent with most real-world situations. For example, a 2002 survey (1999 survey, re- 
spectively) by the Committee on Corporate Reporting found that 21 (30) out of the 25 (31) big audit firms’ 
clients surveyed engaged their auditors with annual fixed fee contracts (see http:/ /www.fei.org/ download / 
2002. Audit. Fee.xls). 
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h € [0, h] the cost of the cooperation/helping effort by his partner. The audit technology 
is characterized by the probabilities of the engagement auditor's correct attestation report 
conditional on the company's real financial condition and the partners' joint effort inputs 
as follows: 


Pr[g|Good, e, h] = 1 and Pr[b|Bad, e, h] = ule, h) (3) 


where g (b) denotes the engagement auditor's report that the company is good (bad). That 
is, we assume that there is no Type I error in the sense that the engagement auditor will 
not issue a “b” report on a good company; on the other hand, the auditor may issue a “g” 
report on a bad company? With such a specification of audit technology, we represent the 
audit quality by p, namely the conditional probability of detecting a bad company. 

As discussed in Section I, the audits of different types of client companies require 
different amounts of partner collaboration, and client types are determined by companies' 
operating characteristics. To model this situation in a tractable manner, we assume two 
types of clients, complex and simple, and denote them by two real numbers C and 5, 
respectively, where C < S. We let the audit quality function ,(-) be parameterized by a real 
number x € (x, x), and let the set of client types be a subset of the parameter set. That is: 


u: [0, e] x [0, h] x (, x) — [0, 1] and (C, S) C @, 3). (4) 


We represent the numbers of different types of clients in each location by positive integers 
ni, where i € (1, 2} and t € [C, S}. Note that nf + n? = n, 

We assume that the audit quality function, u(e, h; x), is continuously differentiable and 
has the following properties on its domain, with the subscripts e, h, and x denoting 
derivatives: 


Assumption 1: p, = 0 if e = 0; p = 1 if, and only if, (e, h) = (e, h). 
Assumption 2: p, > 0; p, = 0, with strict inequality if e > 0. 
Assumption 3: p... = 0; wp < 0, with strict inequality if e > 0. 


Assumption 4: i, < 0 and p, p — pn pu, > O for any (e, h) in the set (0, e) 
x (0, h). 


Assumption 5: p, > 0. 


The first part of Assumption 1 means that the engagement auditor's effort is essential, 
in that the probability of detecting a bad company is zero if the engagement auditor puts 
forth zero effort (regardless of his partner's cooperative effort). The second part of As- 
sumption 1 means that a perfect audit is conducted (i.e., the probability of detection is 1) 
if and only if both partners' effort levels attain the upper bounds. Assumption 2 means that 


? These assumptions of audit technology are reasonable since they translate into the real-world situations that (1) 
an auditor will not find any misstatement in the manager’s financial report when there is no misstatement and 
(2) an auditor may fail to detect material misstatements in the manager's financial report when the report is 
misstated. These assumptions of audit technology are also common in the literature; see, e.g., Dye (1995) and 
Pae and Yoo (2001). | 


> 
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audit quality increases with partners’ effort.'? Assumption 3 implies that the marginal benefit 
of engagement audit effort e (cooperation effort h) increases (decreases) with client type. 
Assumption 4 implies that i: is a strictly concave function of effort pair (e, h), which means 
that the marginal benefit of a person's effort is decreasing with his own effort. Assumption 
5 states that the marginal benefit of a person's effort is increasing with his partner's effort; 
i.e., the partners’ effort inputs are complements in the audit process. By way of an example, 
note that these assumptions are satisfied by the following class of functions: u(e, h; x, p) 
= xel/? + (1 — xY(eh)'^*, where e € [0, 1], h € [0, 1], x € (0, 1), and p > 2. 

With the audit technology specified in Equation (3), it is clear that audit failure (con- 
ditional on a company being bad) occurs with probability (1 — p). For simplicity, we 
consider a strict liability regime in which the audit firm is liable whenever shareholders 
suffer losses as a result of audit failure. That is, when audit failure occurs and a bad 
company continues to operate, the shareholders suffer financial losses—which equal their 
investments in the bad company, I, in the context of this model—and get legal payments, 
denoted by / > 0, from the audit firm through litigation. 

-We now examine the shareholders’ decision problem. If the audit firm issues a “b” 
report, then the shareholders will liquidate the company since there is no Type I error (.e., 
no incorrect “b” report). On the other hand, if the audit firm issues a “‘g” report, then the 
shareholders will continue their investments in the company. The reason is as follows. Given 
a “g” report, the. shareholders’ payoff from continuation of their investments is: 


Pr[Good|g]V’ + Pr[Bad|g]l, (5) 


where Pr[Good|g] = wees and Pr[Badle] = 1 — Pr[Good|g]. In contrast, 


given a “g” report, the shareholders’ payoff from liquidation is: 
Pr[Good|g]V + Pr[Badle]/. (6) 


By comparing Expressions (5) and (6), we know that shareholders given a “g” report get 
a higher payoff from continuation than from liquidation, because: 


pV + (1 — o1 — pil 
Good|g]V' + Pr[Badlg]] = ————————————— 
Pr[Good|g]V' + Pr[Bad|g] $3 0-123 


_ $V + Q — $I + ( — &XI - ii 
$ + (1 — 6) — 1) 
_ $V + 0 — 901 - p)! 
$ + (1 — 0-5 
= Pr[Good|g]V + Pr{Bad|gl, 


'? Assumptions 1 and 2 are similar to those in Itoh (1991), who models mutual help between agents in a principal- 
agent setting. ; 

!! We considered a negligence liability regime in an earlier version of the paper, but that merely complicates the 
notation without adding any new insights. See Schwartz (1997) for a discussion of different legal regimes and 
their implications. 
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where the first inequality comes from substitution from Inequality (1). Based on the share- 
holders' optimal continuation/liquidation decisions conditional on different audit reports, 
the shareholders’ ex ante payoff from purchasing an audit is: 


Pr[gI(Pr[Good|g]V' + Pr[Bad|g]/) + Pr[b]Pr[Bad|bY — f 
= $V' + (1 — 6X1 — ui + (1 — oui — f, (7) 


where f denotes the audit fee. 

Taking the difference between the shareholders' ex ante payoffs with and without an 
audit, as stated in Equations (7) and (2), the shareholder surplus (i.e., the shareholders’ net 
benefit) from purchasing an audit is: 


(1 — b) + (1 — 9X1 — p)! — f, (8) 


which represents the expected savings of the wasteful investments in a bad company plus 
the expected legal payments from the audit firm through litigation, less the audit fee. The 
shareholders’ problem in choosing an audit firm is to maximize the shareholder surplus. 


The Audit Firm’s Problem 


Since audit technology depends on client type, as shown in Condition (4), an audit firm 
may wish to vary its compensation structure according to client type in order to induce 
different partner behavior suitable for different types of clients. However, realistically, it 
would be too costly, if not impossible, for an enforcement party (such as the head office 
of the firm or a court) to verify each client's type in order to settle compensation disputes 
among the firm's partners. This is because the client's operating characteristics, which 
determine both its type and the necessary amount of partner cooperation for an efficient 
audit, are generally difficult to verify. Even if some of the information associated with client 
type is verifiable—such as the number of subsidiaries a client has in different locations, or 
the number of industry sectors in which it operates—this information may not perfectly 
classify a client. To see this, consider a case in which a company has many subsidiaries 
located in different countries. If the affiliated companies engage in many intercompany 
transactions, then they are likely to be complex clients because a significant degree of 
cross-jurisdiction partner consultation may be needed to address difficult auditing issues 
arising from transfer-pricing, tax-planning, and related-party transactions. However, if the 
affiliated companies' operations are relatively independent, then they are likely to be simple 
clients, and their audits may not require much partner collaboration. 

The emphasis of our analvsis, then, is not on the case in which client type is contract- 
able, but on the more realistic case in which client type is not contractable. Nevertheless, 
since the analysis of the latter can be built on the former, we analyze both cases in order 
to better illustrate their differences. 


III. CLIENT TYPE CONTRACTABLE 
Profit-Sharing Rules 
An audit firm's profit-sharing rules consist of revenue-sharing and liability-sharing 
rules, which are not necessarily the same. For tractability of the analysis, the sharing rules 
are assumed to be linear: 
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y= > R+ DS (1-o)R!- Dd PLU- > (1-BIL +n), (9) 
(€ {CS} re {CS} t€ (C,S) 


t€ (C.S) 
where: 


y, = Partner Ps compensation; 
i, J = partners in the firm, ij, i, j € (1, 2}; 
t = the client's type, t € (C, S}; 
R: > 0 = audit fee revenues related to the z type clients audited by Partner i; 
Li = 0 = legal liability costs related to the ¢ type clients audited by Partner i; 
ai = finite incentive weight on revenues related to t type clients audited by Partner i; 
B: = finite incentive weight on legal liability costs related to t type clients audited by 
Partner i; and 
ny = lump sum transfer between partners, niz = —Tər 


Accordingly, the revenue-sharing (liability-sharing) rules, characterized by = (B), specify 
how the partners share each other's engagement revenues (legal liability costs) within the 
firm. 


Timeline for the Auditors’ Game 


As depicted in Figure 2, the timeline for the auditors’ game is as follows: At Stage 0, 
in each firm, partners choose their profit-sharing rules. We assume that a partner agrees to 
participate in the game if his expected payoff given the profit-sharing rules is non-negative. 
At Stage 1, in each location, auditors from different firms submit bids, under the names of 
their firms, to shareholders in their jurisdiction. For each client, the auditor who provides 
shareholders with the higher surplus wins the bid. At Stage 2, in each firm, partners choose 
effort levels in performing audits and in helping each other in the performance of audits. 
Finally, companies’ returns are realized and thus the companies’ natures (i.e., whether they 
are good or bad companies) are revealed; legal payments are made by audit firms to share- 
holders if shareholders suffer losses as a result of audit failure; partners are compensated 
according to the sharing rules specified at Stage 0. As noted earlier (see footnote 2), the 
terms “partner” and “auditor” are used interchangeably in this paper, although sometimes 
we use one term over the other in order to emphasize one of the person’s dual roles: an 
auditor, who performs audits for shareholders, is also a partner who provides cooperation/ 
helping effort to his colleague and makes various decisions within the audit firm. 


FIGURE 2 
Timeline for the Auditors’ Game 


4 i | 


Stage 0: Stage 1: Stage 2: Finally, payoffs 
Sharing rules are Auditors in each Auditors in each are realized and 
determined in location bid for firm cooperate in the game ends. 
each firm. audit engagements. performing audits. 
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This is a three-stage game with observed actions.!2 In particular, we assume that the 
firms' sharing rules become observable to all the players before the start of Stage 1. We 
believe this assumption is a reasonable description of the real world. For example, Burrows 
and Black (1998, 522) interviewed partners from the Melbourne offices of the then Big 6 
firms and report that "despite the dearth of published information on the topic, each partner 
knew in general how rival firms shared profits." ? This assumption is also commonplace in 
the literature on strategic delegation in oligopoly firms (e.g., Fershtman et al. 1991; Reitman 
1993; Aggarwal and Samwick 1999). As Fershtman and Judd (1987) argue, firms' com- 
pensation contracts are infrequently altered and repeated play of the game should enable 
interested parties to learn about these contracts. We therefore solve the game for a subgame 
perfect equilibrium through backward induction. 


Stage 2’s Problem: Effort Choice in the Performance of Audits 


The effort choice subgame in Stage 2 consists of two independent, simultaneous games, 
each played by partners within an audit firm. Given symmetry, it suffices to analyze the 
problem of just one firm. Within the firm, for an arbitrarily chosen client, each partner's 
Objective is to choose an effort level to minimize his personal costs (comprised of his effort 
cost and his share of the expected legal liability costs associated with the audit), treating 
sharing rules and his belief about his partner's effort choice as given. Formally, with 
w.(e, h; t) written more concisely as u'(e, h), the engagement auditor's decision problem is: 


min Bj (1 — 6X1 — pe, h))I + e, (10) 
ecioz] 


and his partner's decision problem for this client is: 


min (1 - BI — &X1 — wie, D) + A, (11) 


REIO, h 


where í € (1, 2) and t € (C, S). Observe in Objective Functions (10) and (11) that the 
revenue-sharing rule, a, does not enter into partners’ effort decisions. This is because, given 
fixed-audit-fee contracts, the revenues a partner receives are fixed when he makes effort 
decisions. 

We restrict the liability-sharing rule, B, to non-negative values, since the firm can always 
replace a negative value for B with zero to induce the same effort profile (that is, zero effort 
in the audit). Furthermore, we impose the following technical assumption to ensure an 
interior solution: 


Assumption 6:lim p, — o, lim p, — o provided e > 0; (1 — pul < 1, 
e—0 h—0 


(1 — )p,(A)l < 1. 


Proposition 1:In an audit firm, given the liability-sharing rules characterized by 
(Bç, B$, BS, BS}, a partner chooses the same unique level of ef- 
fort—audit effort e or cooperation/helping effort h—for every client of 
the same type. Specifically, if B; = 0, then e = h” = 0, where i = j, 


' The assumption that partners observe audit fees before choosing effort is not essential because, given fixed audit 
fee contracts, effort choices do not depend on the audit fees set in Stage 1. 

15 Our interviews with the partners from Arthur Andersen and KPMG also suggest that audit partners are knowl- 
edgeable about the rival firms’ compensation schemes. 
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i,j € {1, 2}, t € (C, S). If B: = 1, then e € (0, e] and A7 = 0. If 
B; € (0, 1), then e” € (0, e) and b € (0, h), and (ej, hj) are deter- 
mined by the following conditions: 


BKI — ó)u;(e, hl — 1 = 0, (12) 
(1 — BDC — pele, hl — 1 = 0. (13) 


Proposition 1 shows that the equilibrium effort profile for an audit, (ej, hi"), is deter- 
mined by the firm's liability-sharing rule, 8, and by the client's type, t. First, observe that 
an auditor has no incentive to help his partner if he bears no responsibility for his partner's 
audit failure (i.e., h” = 0 if B; = 1), and the auditor is induced to provide a positive amount 
of cooperation/helping effort if he bears some responsibility for his partner's audit failure 
Ge, h > 0 if 0 < Bi < 1). 

Next, observe that the same lHability-sharing rule induces different effort profiles for 
different types of clients. More specifically, given any liability-sharing rule B € (0, 1), the 
equilibrium effort level changes with client type, with the direction of the change dependent 
on two opposing forces. On the one hand, based on the Equilibrium Conditions (12) and 
(13), Assumption 3 implies that when the client type increases from C to S, the direct 
effects are that audit effort e (cooperation effort h) increases (decreases) for a given level 
of the other partner's effort. On the other hand, Assumption 5 implies that partners' effort 
inputs are strategic complements in the sense that the increase (decrease) of one partner's 
effort raises (reduces) the other partner's incentive to increase effort. Therefore, the overall 
effect of the increase of client type on effort depends on which of the two forces 
dominates.'* 


Stage 1’s Problem: Pricing Strategies 

The subgame in Stage 1 consists of two independent, simultaneous games played in 
two locations, and in each location, auditors from different firms compete in bidding for 
clients. Given symmetry, we focus on auditors’ bidding/pricing strategies in just one lo- 
cation, say, Location i € (1, 2}. We first characterize the competing auditors’ payoffs from 
an audit engagement. We let k‘; represent Auditor ri's personal cost of the audit, and K‘, 
represent the firm's cost (i.e., the cost to the firm's partners as a whole), where r € (A, 
B) and ¢ € (C, S). Based on Proposition 1, we have: 


Ky = KB) = BA — o)[1 MEB), A7 (951 + e" (B5), 
K; = K(Br) = à - $1 — uC"(Br), ABD) + e" (Bro + ^" (B5. 


Hence, Auditor ri gains a personal payoff of (et, ft, — K4) if he wins the audit en- 
gagement, whereas the firm's payoff/economic profit from the audit is (f4 — K1) (see 
footnote 6 for the definition of the firm's economic profit). Note that the personal payoff 
may differ from the firm's payoff because, when the personal payoff is calculated, the self- 
interested auditor discounts the audit fee revenue and the expected legal liability costs 


14 In particular, it can be shown by Cramer's rule and Assumptions 3—5 that, given any B € (0,1), the equilibrium 
audit effort e* (cooperation effort h*) strictly increases (decreases) as client type changes from C to S, when 
the direct effect dominates, i.e, when L, > (yf abe, (when —p,, > (— Herl penu respectively) for all x 
€ [C, S]. 
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associated with the audit by a and B, respectively, and completely ignores his partner's 
cost of cooperation effort, h. The incentive problem caused by the discrepancy between the 
auditor's personal payoff and the firm's payoff will soon become clear (see discussion 
following Proposition 2). 

We now analyze the auditors' bidding strategies. Recall from the analysis in Section II 
that shareholders will choose the auditor who provides the higher shareholder surplus, as 
stated in Expression (8), which equals the gross benefit of the audit, denoted by v', 
= v'(Bi = (1 — 6)uiI + (1 — X1 — p), less the audit fee, f‘,. Since the gross benefit 
of an audit is fixed given the firm's liability-sharing rule, an auditor's choice variable in 
this stage is the audit fee. In bidding for clients, each auditor's objective is to choose an 
audit fee so as to provide just a slightly higher shareholder surplus than does his competitor, 
while not allowing the audit fee to fall below his break-even price, which is denoted by 
f, and fr, = k1,/a/, by definition. We let o7, be the maximum amount of shareholder surplus 
Auditor ri can provide to shareholders, which is the difference between the gross benefit 
of the audit and the auditor’s break-even price: o/, = v', — f}. Note that we restrict revenue- 
sharing rules to positive values (i.e., a > 0), because a firm chooses a non-positive o only 
if it wants to discourage its partners from bidding for clients, but the firm can always use 
a sufficiently small positive « such that the maximum shareholder surplus is negative (i.e., 
c < 0) to achieve that objective. 

As in Grossman (1981) and Hurter and Lederer (1986), we impose the feasibility 
restriction that an auditor will not bid below his break-even price. In addition, to ensure 
the existence of equilibrium, we make the following tie-breaking assumption about share- 
holders' preferences: 


Assumption 7: If auditors provide the same positive shareholder surplus, then share- 
holders choose the auditor for whom the maximum shareholder surplus 
is higher, and they choose auditors randomly with equal chance if the 
maximum shareholder surplus is the same for all auditors. 


Proposition 2: In Location i € (1, 2], given the firms' sharing rules characterized by 
(a5 B4, ai, Br). t € (C, S, q # r, q, r € (A, B}, for each client, 
if the maximum shareholder surplus, c, is negative for both auditors 
(i.e., if of, < 0 and o} < 0), then both auditors have zero demand. If 
o4 < 0 and c7, = 0, then the auditor from Firm q has zero demand, 
and the auditor from Firm r wins the bid with the monopoly price equal 
to the gross benefit of the audit, v‘, and gains a non-negative personal 
payoff (a positive payoff if o, > 0). If 07, = o7, = 0, then both auditors 
bid their break-even prices and win the bid with equal chance. If 0 
< of, < a}, then the auditor from Firm q bids his break-even price, 
fj, and the auditor from Firm r wins the bid with the price equal to 
(v^, — of,) and gains a positive personal payoff. 


The bidding/pricing equilibrium characterized by Proposition 2 is intuitive: given that 
shareholders choose the auditor who provides the higher (positive) shareholder surplus, the 
auditor for whom the maximum shareholder surplus is higher has a competitive advantage, 
in the sense that the auditor can always offer a higher shareholder surplus (relative to his 
competitor) with an audit fee greater than his break-even price. However, as noted earlier, 
the auditor's personal payoff may differ from the firm's payoff for an audit engagement. 
Consequently, the auditor's equilibrium pricing strategy may not be optimal from the part- 
nership's point of view. In particular, an auditor has an incentive to bid below the firm's 
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cost when his break-even price, f, falls below the firm's cost, K. To illustrate, consider a 
case where 0 < o4, < o^. In this case, Auditor Al wins the bid and gains a positive 
payoff with the equilibrium price f 7, = v^, — o5, > f^ (by Proposition 2), but Firm A 
as a whole suffers a loss when the auditor's break-even price is less than the firm's cost, 
i.e., when f^, < f%, < K4,. Conversely, it is easy to see that an auditor has an incentive 
to forgo a bid profitable to the firm when his break-even price exceeds the firm's cost, i.e., 
when f > K. | 


Stage 0’s Problem: Profit-Sharing Rules 

At the contracting stage, observe that within each audit firm both partners prefer the 
sharing rules that maximize the sum of their payoffs, which is just the firm's payoff, namely 
the firm's economic profit. Thus, the game in Stage 0 can be viewed as a simultaneous 
game between the two firms, with each firm choosing its sharing rules to maximize its 
payoff against the compensation strategy of the rival firm. To focus on the incentive effects 
of the sharing rules, we will ignore the determination of m, namely the predetermined lump 
sum transfer between the partners of a firm, because the transfer does not affect the partners' 
effort or pricing decisions (i.e., it has no incentive effect), and hence does not affect the 
sum of the partners’ payoffs.!5 

Before proceeding with a firm's problem of finding the optimal incentive weights, a 
and B, it is useful to comment on the partners’ participation constraints. Recall that a partner 
will agree to play the game if his (expected) payoff given the compensation contracts is 
non-negative. It is easy to see that if the firm's economic profit (i.e., the sum of the partners’ 
payoffs) is non-negative, then there always exists a lump sum transfer, n, between the firm's 
partners so that each partner gets at least zero payoff. Therefore, a sufficient condition for 
the partners’ participation constraints to be satisfied is that the firm’s economic profit is 
non-negative. 

We first consider the firm’s choice of revenue-sharing rules, a. Note that a partner’s 
break-even price is ft, = x'(B‘,)/a‘,. Thus, to each value of B‘,, there corresponds a revenue- 
sharing rule, o, = «‘(B',)/K‘(B‘,), such that the partner’s break-even price coincides with 
the firm's cost (namely f‘, = K‘,), and the partner's pricing strategy therefore also maxi- 
mizes the firm’s payoff. In this sense, such a revenue-sharing rule is optimal for the firm 
conditional on the given B. 

We now consider the firm’s choice of liability-sharing rules, B. Natural candidates are 
those that induce partners to perform audits efficiently. This leads to the following 
definition: 


Definition 1: For a client of type t € (C, S), define B’° as the liability-sharing rule 
that maximizes the audit’s value, denoted by Q(B‘, D, which is measured 
as the sum of the shareholder surplus from purchasing the audit and the 
audit firm's economic profit from producing the audit. That is, B^? solves 
the following maximization problem: 


max Q(B' D = WB) — f) + (f — KB) 


P'ero.gi 
= v'(B^ — K'(B^ 
= (1 — $)ue"(B5, h"(B))] — e"(g^ — h"(B5, (14) 


!5 A bargaining game between the firm's partners will be called for to determine the lump sum transfer, n, which 
is not the focus of this paper. 
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where B is such that e"(B) = e and &"(B) = O for any B = B and t 
€ (C, S}. 


Note in Equation (14) that the value of an audit, Q(B‘, p), is equal to the expected 
savings due to the audit of wasteful investments in a bad company, minus the partners' 
joint effort costs in performing the audit. An efficient audit is one whose value is maximized 
by the liability-sharing rule B'?^ through the induced effort profile, and the value of an 
efficient audit equals O(e"(B'%, 5"(g'?); À. 

We assume enough regularity such that B'^ is unique. In addition, assume that B° 
> 0, which implies that the value of an efficient audit is positive, thus ensuring the existence 
of an audit market. Next, we assume that an efficient audit requires teamwork (namely h 
> 0). Based on Proposition 1, this assumption implies that B‘° < 1.'° That is, it is optimal 
for partners to share the legal liability costs of an audit in order to induce the partner 
collaboration required for the efficient performance of the audit. Finally, assume that the 
difference between the two types of clients—as characterized by the parameterized audit 
quality function j(e, h; t)—is nontrivial such that B^ + BC. This implies that the effort 
profile, (e, h), required to provide an efficient audit varies according to client type. In 
particular, using Topkis’ monotonicity theorem and Equation (14), we observe that g^ 
< BY when the following condition holds: 


aO(B, C) _ MB, S) 
ap op 


where 0((B,D/0B = [(1 — puil — leg + [I — pal — I]hg, t € (C, S), and the 
subscripts e, h, and B denote derivatives. Condition (15) states that the marginal benefit of 
B is greater for an S type client than for a C type client. Thus, when Condition (15) holds, 
a higher client type makes a larger B desirable; that is, the efficient audit of an $ type client 
requires that greater legal liability costs be imposed on the engagement partner, compared 
with the efficient audit of a C type client. 

The next proposition establishes the firms' equilibrium compensation strategies. 


for all B € (0, 1), (15) 


Proposition 3: Firms have the same dominant compensation strategy: For clients of 
type t € (C, S], the sharing rules are (a^, B^) = (a°, B*°), where r 
€ (A, B}, i € (1,2), a? = «(g'/K'(p'^, and <! and K: are the 
engagement partner's personal cost and the firm's cost of an audit, 
respectively. In the dominant strategy equilibrium, firms share clients 
of each type and earn zero economic profit. 


The equilibrium has the two firms setting the same compensation strategy, composed 
of client-type-specific sharing rules. The intuition for the equilibrium is as follows. The 
equilibrium revenue-sharing rule equates the partner's break-even price to the firm's cost 
for each type of client (1.e., ft! = K: when a'? = «‘/K‘). As such, the revenue-sharing rule 
induces a partner to choose the audit fee to maximize the firm's payoff. Furthermore, under 
such a revenue-sharing rule, the maximum shareholder surplus for the partner equals the 
value of the audit (since o! = u! — f! = u! — K: = (Y when f' = K^; therefore, by 
maximizing the audit's value, the equilibrium liability-sharing rule, B!°, maximizes not only 


‘6 Recall that Proposition 1 shows that an auditor is induced to provide cooperation to his partner if and only if 
he bears some responsibility for the legal liability costs of his partner’s clients. 
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the partner's incentive to win an audit engagement, but also the firm's payoff from the 
audit. To see this, note from Proposition 2 that the firm's payoff from an audit it performs 
equals the audit's value less the rival auditor's maximum shareholder surplus (i.e., Firm r’s 
payoff is (v; — of) - Ki = OF — epr, q € (A, BJ). 

It is interesting to note that B‘° € (0, 1) implies that a'? € (0, 1). That is, given that 
it is optimal for partners to share the legal liability cost of an audit in order to induce 
teamwork for the audit, it is also optimal for partners to share the audit fee revenue of the 
audit. To explain revenue sharing, first note that f' = <! < K! when a‘ = 1, which means 
that a partner tends to bid below the firm's cost (1.e., ff < K^ when he gets the entire fee 
revenue from his clients (i.e, when a‘ = 1). Next, note that ft = «'/a! = K! when a! 
= k'/K' < 1, which implies that sharing revenues (i.e., =! < 1) has an effect of imposing 
“taxes” on engagement partners in order to counteract the individual partner's inappropriate 
aggressiveness with regard to his pricing policy, thereby aligning the partner's break-even 
price, f‘, with the firm's cost, K“. 

We conclude this section by noting that the equilibrium is a perfect competition equi- 
librium: the same compensation schemes by the firms induce the same effort and pricing 
strategies. As a result, for each client, auditors from different firms bid the same price 
(which equals the firm's cost), have an equal chance to win the bid, and earn zero economic 
profit. In equilibrium, the two firms have the same market share in each location, namely 
(nt + nf)/2, i € (1, 2). Shareholders benefit the most from the perfect competition in that 
they collect the entire value of the audits. 


IV. CLIENT TYPE NOT CONTRACTABLE 

The perfect competition result in the previous section hinges on the assumption that 
client type is contractable, which as discussed in Section II, we believe, is implausible in 
the real world. In this section we examine the case in which client type is not contractable. 
Specifically, we assume that a client's type is observable to the parties who are actively 
involved in the audit (including representatives of the shareholders who hire auditors, the 
engagement partner, and the partner who provides assistance in the performance of the 
audit), but cannot be verified by an enforcement party such as the head office of the audit 
firm or a court, and thus cannot be used for partner compensation purposes." Accordingly, 
the firm's Compensation Contract (9) cannot vary with client type and becomes: 


y=a, 2, Rit+d-a) 2 Ri-B > Li-ü-8) 2 L: + mp 
t€ (C,S) t€: (C,S) z€ |C,S) z€ |C,S) 
(16) 
where i +j, i, j € (1, 2}. 


Analysis and Results 


Since the effort choice and the bidding subgames are the same as before, the effort and 
pricing strategies—as functions of the profit-sharing rules—are the same as characterized 
in Propositions 1 and 2. The critical difference between this and the previous case lies in 


17 Partners must be able to verify the client's type in order to implement a type-dependent sharing rule. Although 
the client's type is observable to the engagement partner and the helping partner, both of whom are actively 
involved in the audit and hence have learned the client's type in the course of doing business, they have incentives 
to misreport the client type ex post (e.g., a helping partner tends to claim a simple client as a complex client to 
seek more compensation for his cooperation effort). Verifiability implies that an independent third party, such 
as a court or other partners in a multiple-partner firm, can easily settle compensation-related disputes. 
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the contracting stage. In the previous case, a firm can use client-type-specific sharing rules 
to induce optimal partner behavior for all types of clients; but when client type is not 
contractable, the firm cannot find uniform sharing rules that induce the same optimal be- 
havior. To see this, note that a liability-sharing rule, B, does not exist that induces efficient 
audits for all clients because B + B”, Despite this apparent contracting disadvantage, 
careful examination of interactions between firms reveals that the firms may differentiate 
themselves by specializing in different types of clients, thereby gaining limited monopoly 
power over the clients in which they specialize. Proposition 4 identifies conditions under 
which there exists a specializing equilibrium defined as follows: 


Definition 2: In Location i € (1, 2), a specializing equilibrium is characterized by the 
firms’ liability-sharing rules (B,, B,) = (B?, BC), q * r,q, r € (A, B). 
That 1s, in equilibrium, firms specialize in different types of clients by 
choosing different value-maximizing, liability-sharing rules. The type of 
clients in which a firm specializes constitutes the firm's niche market. 


Proposition 4: There exists a specializing equilibrium where By, = B4 = B”, Ba: 
= B5 = B^, a4, = aj = w, a5, = ap = a®, i € (1, 2}, if the 
following conditions hold: 


S/Q OSK Cft, Co 

X8?) — KB > vie - SE. (17) 
CLR SYR Sr n Se 

DABS) - KIBA) > a qe -Ea (18) 


In equilibrium, each firm audits only clients from its niche market 
and earns positive economic profits as follows: 


[L = (in(v$, vf — o3) - K$S5Gš$ + n, (19) 
IL = Guin(v$*, vS — of} — KgYnf + nO), (20) 


where II denotes the firm's economic profit, and n! is the number of 
clients of type t € (C, S) in Location i € (1, 2]. 


Panel A of Figure 3 depicts the equilibrium established in Proposition 4, where the 
firms specialize in different types of clients: Firm A's market share is n? + nš and Firm 
B's market share is n€ + nš. In equilibrium, Firm A (Firm B) chooses the liability-sharing 
rule that maximizes the value of audits for the S type (C type) clients, so that auditors from 
Firm A (Firm B) become specialists for the S type (C type) clients. Note that an auditor 
acting as a specialist for a type of client does not mean that he has specialized skills at 
performing the audit, but rather that—because of his firm's compensation schemes-—the 
auditor and his partner are induced to provide a more efficient audit (i.e., a higher value 
audit) for that type of client than would the auditors from the rival firm. 

We now explain the sufficient conditions, Inequalities (17) and (18), under which a 
specializing equilibrium exists. Note that the specializing liability-sharing rules (that is, 
efficient audits) alone do not ensure that auditors win the clients in which they specialize. 
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FIGURE 3 
Specializing Equilibria 
Panel À: Partners within a Firm are Compensated in the Same Way 


Partner Al ——————9 S Type Partner A2 — — P S Type 


Partner BI ——— C Type Partner B ————— C Type 





Location 1 Location 2 
For the numerical example given in the text, in this case, af, = af, = 0.9994, BF = BE = 0.998, 
at = a£, = 0.8807, B# = BE = 0.8677. Fix nj = 10 where i € (1, 2) and t € (C, S}; the total 
shareholder surplus is 17.13 and Firm A's (Firm B's) economic profits are 1.584 (0.068). 
Panel B: Partners within a Firm are Compensated in the Same Way 


Partner A1 Partner A2 


Partner B1 Partner B2 





Location 1 Location 2 
For the numerical example given in the text, in this case, af, = až, = 0.8807, B%, = BZ 
= 0.8677, af, = a£, = 0.9994, Bf, = B£, = 0.998. Fix nj = 10 the total shareholder surplus 
is 17.13 and Firm A's (Firm B's) economic profits are 0.068 (1.584). 
Panel C: Partners within a Firm are Compensated in Different Ways 


Partner Al ——— h S Type Partner A2 


Partner B] ————— C Type Partner B2 





Location 1 Location 2 


For the numerical example given in the text, in this case, af, = af, = 0.9994, p#, = BE, = 0.998, 
af, = af, = 0.8807, BR, = BY, = 0.8677. Fix n; = 10 the total shareholder surplus is 17.13 and 
Firm A’s (Firm B's) economic profits are 0.826 (0.826). 


Panel D: Partners within a Firm are Compensated in Different Ways 
Partner Al Partner A2 — — —» S Type 


Partner B1 Partner B2 —————® C Type 





Location 1 Location 2 


For the numerical example given in the text, in this case, af, = o£, = 0.8807, Bt, = BE 
= 0.8677, af, = at, = 0.9994, Bf, = Bt, = 0.998. Fix n; = 10; the total shareholder surplus 
is 17.13 and Firm A's (Firm B's) economic profits are 0.826 (0.826). 
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We know from Proposition 2 that the bidding outcome depends on the maximum share- 
holder surplus, which is based not only on the induced effort profile, but also on the induced 
pricing policy. More specifically, although the revenue-sharing rule of Firm A induces the 
optimal pricing strategy for the S type clients, it cannot do so for the C type clients at the 
same time since a = «5(g5)/K5(B5») + «“(B*)/K“(B”) (which implies that the break- 
even price differs from the firm's cost for the C type clients, i.e., fç + KC). Thus, Firm A 
must be concerned that its partners would attract the C type clients with prices below the 
firm's cost. This situation would be prevented by Condition (18), which states that the 
maximum shareholder surplus of Firm A for a C type client (the right-hand side) is less 
than the maximum shareholder surplus of Firm B for a C type client (the left-hand side, 
which equals the value of an efficient audit), i.e., o£” < c S* (see the Appendix for details). 
In other words, Firm A's sharing rules are such that its partners—who are nonspecialists 
for a C type client—would not be able to provide a shareholder surplus greater than the 
value of an efficient audit provided by Firm B for a C type client. It is easy to show that 
if Condition (18) does not hold, then Firm A’s partners would bid below the firm's cost for 
a C type client; that is, if o£ = of, then f£. < KÇ. It follows that Condition (18) holds 
when Firm A's partners are induced not to bid below the firm's cost for Firm B's clientele. 
A similar analysis applies to Firm B and Condition (17). In sum, these conditions impose 
restrictions on the parameters of the model such that partners are induced not to bid far 
below their firm's cost for the rival firm's clientele.? It should be noted that Conditions 
(17) and (18) are expressed in terms of B'^, which itself is a function of the primitive 
parameters of the model (see Definition 1). While the complexity of the multistage problem 
prohibits expressing the sufficient conditions in terms of the primitive parameters of the 
model, we later use a numerical example to illustrate cases in which these conditions hold. 

We observe that neither firm has an incentive to deviate from the equilibrium in which 
the firms obtain positive payoffs (namely economic profits) from specialization. To explain 
this, suppose that Firm A wants to appropriate Firm B's payoff. Firm A must first change 
its liability-sharing rule so as to provide higher value audits for Firm B's clientele. Since 
Firm B has been efficient (and its partners are induced to bid in the firm's interest) with 
respect to its clientele, Firm À cannot earn any economic profit from the rival firm's niche 
market. However, changing its liability-sharing rule will cause Firm A to provide lower 
value audits for its own niche market and to earn less economic profit from each of its 
clients, which equals the decreased value of the audit less the rival auditor's maximum 
shareholder surplus (if o3 = o or equals 0 (if o3 < o5). A similar analysis applies to 
Firm B. As a result, positive equilibrium payoffs will not be bid away in competition. 

Although the total value of audits in the specializing equilibrium is the same as in the 
perfect competition equilibrium analyzed in Section III, the audit's value with specialization 
is divided between the audit firms and the shareholders; in contrast, all the value of audits 
accrues to the shareholders under perfect competition. 


Example 

To help understand the equilibrium results, consider the following example. Suppose 
the audit quality function is un(e, h; x) = xVe + (1 — x) Veh, where e € [0, 1], h € [0, 
1], x € (0, 1). In addition, take C = 0.5, $ = 0.95, b = 0.7, L = I = 6. Thus, we have 
we, h) = 0.5Ve + 0.5Veh and pe, h) = 0.95 Ve + 0.05Veh. 


18 That partners are induced not to bid below the firm's cost for the rival firm's clientele is a more stringent 
condition than Conditions (17) and (18). In fact, Conditions (17) and (18) hold even when partners bid below 
the firm's cost for the rival firm's clientele (e.g., f£ < K£), as long as the bid is not far below the firm's cost 
so that the resulting shareholder surplus does not exceed the value of an efficient audit provided by the rival 
firm. 
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It can be shown that when client type is contractable, the dominant strategy equilibrium 
composed of client-type-specific sharing rules is as follows: (a5, = 0.9994, B+" = 0.998, 
aS” = 0.8807, BE” = 0.8677} for all r € (A, B} and i € (1, 2}. In contrast, when client 
type is not contractable, there exists a specializing equilibrium as follows (see Panel A of 
Figure 3): {a,, = 0.9994, Bi, = 0.998; aj, = 0.8807, B>, = 0.8677}, and the induced 
effort choices are (e$; = 0.7281, h$; = 0.5898 x 1075; eG" = 0.1598, hz; = 0.566 x 107?) 
for all i € (1, 2}. 

As shown in Panel A of Figure 3, in the specializing equilibrium, Firm À (Firm B) 
audits only the simple (complex) clients. Observe that aj, > a and Bi, > Ba, i € (1, 
2), which means that Firm À, which specializes in the simple clients, bases a partner's 
compensation more on his local office performance—as reflected by the revenues and lia- 
bilities associated with the partner's own audit engagements—than does Firm B, which 
specializes in the complex clients. We first explain why B; > B}. Note that Condition (15) 
is satisfied, which as indicated earlier, implies that a larger (smaller) B is desirable for the 
efficient audit of a simple (complex) client. The factors that work together for Condition 
(15) to hold are rather complicated. We here offer a discussion at the most intuitive level 
by focusing only on the dominant effects. As implied by Assumption 3, the marginal benefit 
of cooperation effort h (engagement audit effort e) is greater for the complex client (for 
the simple client). Intuitively, this suggests that the efficient audit of a complex client 
requires more cooperation/helping effort than that of a simple client. Consistent with this 
intuition, Firm B imposes a larger share of the legal liability costs on helping partners 
(namely a smaller B) to induce greater helping effort, compared with Firm A (ie. hS* 
> hš". 

We next explain why a, > a;. It is straightforward to verify that «2 /K3. > «S /K$, 
which means that there is a relatively larger discrepancy between the engagement partner's 
personal cost and the firm's cost of an audit in Firm B than in Firm A with respect to their 
clientele; therefore, a partner of Firm B would be relatively more aggressive in the pricing 
policy if he were fully compensated for each dollar of fee revenues he brought into the 
firm (i.e., if a = 1). It follows that Firm B imposes a higher “tax rate" on engagement 
partners (namely a smaller «) to induce the optimal pricing behavior for complex clients 
than Firm A does for simple clients. In essence, this example illustrates that, when the 
client's type is not contractable, each firm uses type-oriented sharing rules to induce the 
partner behavior (in terms of the partner collaboration and pricing policy) most suitable for 
the type of clients in which the firm specializes. | 

Given that p = 0.7 and I = I = 6, while we cannot check the sufficient Conditions 
(17) and (18) for all the possible values of (C, S), where 0 < C < S$ < 1, we can identify 
a large set of values of (C, S) for which the sufficient conditions hold and thus a special- 
izing equilibrium exists. Examples include the following: (1) Fix S = 0.95; the sufficient 
conditions hold for all C € (0.14, 0.15,...,0.73, 0.74}. (i) Fix S = 0.99 the sufficient 
conditions hold for all C € (0.03, 0.04,...,0.9, 0.91}. (iii) The sufficient conditions hold for 
all (C, © € {(C, SNC € (0.26, 0.27,...,0.59, 0.60), S € (0.92, 0.93,...,0.98, 0.99}}.!9 


Other Issues 


In this subsection we discuss two technical issues and limitations related to our equi- 
librium results: the multiplicity of specializing equilibria, and the observability of compen- 
sation contracts. First, we note that, under the conditions specified in Proposition 4, there 


I? Our study of many hundreds of numerical examples suggests that the sufficient conditions are more likely to 
hold when the client types, C and S, are far apart. 
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exist four specializing equilibria as shown in Figure 3. In addition to the two equilibria in 
which partners within a firm are compensated in the same way—as depicted in Panels A 
and B of Figure 3—the other two equilibria are such that partners within a firm are com- 
pensated differently so that each office of the firm specializes in a different type of client, 
as depicted in Panels C and D of Figure 3. For example, the following is also an equilibrium 
(see Panel C): 


{aji za OB =a, Bi > Bm = p Q gi = Q4) = aC, Ba = Biz = Bo}. 


That is, in Location 1, Partner A1 (Partner B1) specializes in S type (C type) clients, 
whereas in Location 2, Partner A2 (Partner B2) specializes in C type (S type) clients. 

It is straightforward to see that the economic profit earned from each client is the same 
across the four specializing equilibria, regardless of which firm earns the profit. Therefore, 
the sum of the two firms' economic profits is the same across the equilibria. To illustrate, 
we return to the preceding numerical example. It can be shown that in the dominant equi- 
librium analyzed in Section III, for nj = 10 where i € (1, 2) and t € (C, S}, the total 
shareholder surplus is 18.782 and each audit firm earns zero economic profit. By compar- 
ison, when client type is not contractable, for ni = 10, Figure 3 demonstrates the total 
shareholder surplus and the firms’ economic profits for each of the four specializing equi- 
libria; clearly, moving from one to another equilibrium only makes one firm better off and 
the other worse off, with the total of the firms' payoffs unchanged. It can be shown that 
these are undominated equilibria (i.e., firms play undominated strategies in equilibrium).” 
In this case, game theory does not offer a general theory about which of the equilibria will 
be obtained. 

In these specializing equilibria, firms achieve monopoly positions for the clients in 
which they specialize. However, it should be noted that in the real world, there may exist 
mismatches in the sense that a small proportion of an audit firm's clients may come from 
a rival firm's niche market. Admittedly, our model does not consider all the factors that 
affect a client's auditor choice. For example, client companies may desire audit firms not 
associated with their rival companies for fear of revealing proprietary information to their 
competitors (Kwon 1996). In addition, client companies may not want to hire their exec- 
utives' former CPA firms as auditors for fear that audit quality will be impaired (Lennox 
and Park 2003). Moreover, not all companies' auditor choices are determined by indepen- 
dent, competent audit committees (i.e., the auditor choices are not always optimal), which 
can also cause real-world mismatches. 

The second issue we consider is observability of contracts. A crucial assumption un- 
derlying the specializing equilibria is that firms’ compensation plans are observable to all 
game players. This may occur, for example, by learning through repeated play of the game. 
Although this assumption is commonplace in the economics literature on strategic delega- 
tion (e.g., the Fershtman and Judd [1987] and Sklivas [1987] type models), it is often 
criticized on the grounds that the contracts may not be renegotiation-proof (see Katz 1991). 
That is, the equilibrium may not hold if a firm secretly changes its own contracts after all 
firms have publicly announced their contracts. Unfortunately, we cannot solve for an equi- 
librium in which client type is not contractable and compensation contracts are not publicly 
observable.” In this regard, we note that our study differs from Kockesen and Ok (2004), 


20 The proof that these equilibria are undominated is essentially the same as the proof of Liu (2002, Corollary 3.1) 
and is thus omitted. 

21 However, it can be shown that the specializing equilibria that exist under the conditions specified in Proposition 
4 cease to hold when compensation contracts are unobservable. 
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who find that, when the decision of delegating or not delegating is publicly observable, 
there may exist a delegation equilibrium in which unobservable compensation contracts can 
play a strategic role. While the delegation of a job by a principal to an agent is an option 
in their study, decentralization is a must in our study since the "principal" here is not a 
person, but rather represents the collective interest of a partnership in which individual 
partners make decentralized effort decisions. 


V. CONCLUDING REMARKS 

In this paper, we examine the effects of compensation arrangements on the behavior 
of partners within a public accounting firm, as well as on the strategic interactions of the 
firms in an auditing oligopoly. We use a stylized model where (among other assumptions) 
there are two audit firms and two types of clients. We show that when partner profit-sharing 
rules can vary with client type (i.e., client type is contractable), a perfect competition 
equilibrium will obtain. However, when the client's type is not contractable—which we 
believe to be a reasonable description of the real world—firms may strategically differen- 
tiate themselves by choosing different profit-sharing rules in order to specialize in different 
types of clients. By doing so, the firms avoid cutthroat competition and earn positive eco- 
nomic profits from their niche markets. Our equilibrium results generalize naturally to the 
situation in which there are m audit firms and m different types of clients (where m > 2). 
The idea can also be applied to the analysis of the scenario where there are many client 
types in an auditing oligopoly. 

The theory developed in this paper offers an explanation for the observed differences 
in the compensation plans of the big public accounting firms; namely, there exist both 
"small profit pool" and “large profit pool" firms. In the former, local office profitability is 
very important to the compensation of a partner, while local office profitability is not 
important in the latter case. An implication of the situation where client type is not con- 
tractable and firms adopt different compensation arrangements in a specializing equilibrium 
is that the client portfolios of firms will vary. In the year 2000, we conducted structured 
interviews with partners from the Vancouver, Canada offices of KPMG and (former) Arthur 
Andersen & Co. to investigate the compensation arrangements of those two big firms. Based 
on these interviews, we can classify KPMG (Arthur Andersen) as a “small pool" (“large 
pool") firm in the sense of Trompeter (1988). Thus, KPMG’s sharing rules are more tailored 
to the audits of simple companies, which do not require much partner cooperation, whereas 
Arthur Andersen’s sharing rules are more suitable for the audits of relatively large complex 
companies, whose audits require more partner cooperation. These partner compensation 
arrangements can, for example, explain the U.S. Senate survey results (U.S. Senate 1977, 
30—31, 421) regarding the client portfolios of the then Big 8 firms. In that survey, Arthur 
Andersen (Peat Marwick International, now KPMG) reported that, as of 1975, it had 417 
(331) audit clients that were corporations listed on the NYSE or AMSE, in contrast to 6,000 
(23,245) audit clients that were either not listed on the major stock exchanges or were 
privately held corporations. This evidence suggests that, consistent with the implications of 
their compensation plans, Arthur Andersen was relatively specialized in the large/complex 
company sector, and Peat Marwick International (now KPMG) was relatively specialized 
in the small/simple company sector?" We also note that 19 years later, O’Keefe et al. 
(1994) found that, in an industry-stratified random sample of U.S. audit clients of KPMG, 
fully 80 percent were closely held companies. 


7 Similar observations are made by Liu (2002) based on Who Audits America. 
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To conclude, while the above evidence is suggestive and consistent with the theory 
developed in this paper, future empirical studies would be useful to examine the details of 
the actual profit-sharing arrangements among the partners in each of the now Big 4 firms. 
Differences in compensation arrangements can then be studied to determine their impact 
on audit firm decisions and firm characteristics (through the effects on individual partners' 
behavior), such as their client portfolios and audit quality. Such studies will help us gain a 
greater understanding of the functioning of audit firms in the economy. 


APPENDIX 
See Table 1 for notation definitions. 


Proof of Proposition 1 

In the audit firm, consider the case Bi € (0, 1), i € (1, 2). It is easy to see from 
Assumptions 1, 2, 4, and 6 that, for any of Partner Ps clients of type t € (C, S), the reaction 
curves (i.e., partners’ effort choice functions), e(h) and A(e), are determined by (A1) and 
(A2), respectively: 


Bal — d$)uE(e, Al — 1 = 0, (A1) 
(1 — BDU — doni(e, hl — 1 < 0, with equality if e > 0. (A2) 


We first prove the existence of equilibrium. Denote by Á the level of effort by the 
helping partner given that the engagement partner's effort level attains the upper bound, 
ie. A = h (e), which is less than h by Assumption 6. Let h^!(*) represent the inverse 
function of h(-). Note that e(h) and h (h) are continuous and strictly increasing on [0, A] 
by Assumptions 4 and 5. Consider the function ¿£ defined by ¿(h) = e(h) — h^ Wh). t(0) 
> 0 because e(0) > 0 and A^ (0) = 0. On the other hand, z(f) < 0 because e(f) < e and 
h^ (h) = e. Since £ is continuous on [0, A], according to the intermediate value theorem 
there exists kh’ € (0, A) such that z(h') = 0, namely e(h’) = h^ (h'). Thus, the reaction 
curves intersect in (0, A). This proves the existence of equilibrium, and that e” € (0, e) 
and hi € (0, h), where i # j, i, j € (1, 2). 

We now prove the uniqueness of the equilibrium. A sufficient condition for the solution 
to be unique (i.e., for the reaction curves to intersect only once) is that the reaction curve 
of the engagement partner is flatter than that of the helping partner at equilibrium. This 
condition is satisfied given Assumptions 4 and 5. Assumption 4 states that p, p, — Heh 
Wen > 0. Thus, we know from Assumption 5 that —p,,/m.. < — Harl kens which then implies 
de(h)/ah < ah" (h)/ah. 

Therefore, when B; € (0, 1), there exists a unique equilibrium (ej, hr) jointly deter- 
mined by Equations (12) and (13). The proof of the equilibrium in other cases (1.e., B: 
= 0 and Bp: = 1) is straightforward and hence omitted. 


Proof of Proposition 2 

Consider the local competition for a client of type t € (C, S) in Location i € (1, 2). 
Shareholders will not purchase an audit if they get a negative surplus. Therefore, if o7, 
< 0 and gf < 0, where q # r, q, r € (A, B}, then both auditors have zero demand. If 
of, < 0 and of, = 0, then Auditor qi has zero demand, and Auditor ri becomes a monopolist, 
who charges the price of v', to provide just zero shareholder surplus. In this monopoly 
case, Auditor ri's payoff is a/,v', — k'; = 0, where the inequality (equality) follows from 
gt = ut, — khai = 0. 
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If of, = oh = 0, then it is easy to verify that the only equilibrium is for both auditors 
to bid their break-even prices. Since shareholders are indifferent between the two bids, it 
follows from Assumption 7 that the auditors win the bid with equal chance. 

If 0 = of, < of, then we propose that the strategy profile (fo f) = (fon v — op 
is a unique (pure strategy) equilibrium. Note that in equilibrium, shareholders are indifferent 
between the two bids. Thus, Auditor ri wins the bid by Assumption 7. In this case, Auditor 
ri’s payoff is awh — 02) — kh > 0, where the inequality follows from a7, = vh — «}/ 
at, > of, We now show that the above strategy profile is an equilibrium by showing that 
an auditor cannot gain from a unilateral deviation from the equilibrium. Given Auditor ri's 
equilibrium strategy, Auditor qi would still have zero demand if he increased the bid, and 
would suffer a loss if he decreased the bid. Given Auditor qi's equilibrium strategy, Auditor 
ri would lose the profitable bid if he increased the bid, and would leave money on the table 
if he decreased the bid. Furthermore, the (pure strategy) equilibrium is unique given that 
an auditor would not bid below his break-even price. To see this, suppose that Auditor qi 
bids above his break-even price: f7, > f4 This cannot be an equilibrium strategy: Auditor 
ri’s best response is to bid a price at v^, — (v^, — ft), but Auditor qi could then increase 
his payoff by choosing a price between his break-even price and f% 


Proof of Proposition 3 

Due to limited space, we only lay out the main steps of the proof. Interested readers 
can refer to Liu (2002) for the details. 

For clients of type t € (C, S), consider Firm A's problem with respect to Partner Al. 
The proof that (x'(B'2/K'(B'^, B'?) is the dominant strategy for Firm A with respect to 
Partner Al takes the following three steps: 

The first step is to show that (a(B4,), B4) = («(B4)/K(B4,), B4) dominates (o5, 
BÀ, for all a4, * a(B4,) and B4; € {Blv'(B) — KB) > 0}. 

The second step is to show that (x'(B'^)/K'(B'?), B‘°) dominates (a'(B4,), B4,) for all 
Bi, € {BIB # B? and v'(B) -K*XB) > 0}. 

The third step is to show that (x'(B'^?)/K'(B'^), B‘°) dominates (a4, B4,) for all a4, 
and B4, € {Blv'(B) — K'(B) = 0). 

Similarly, we can show that (x'(B'?)/K'(B'? 8'?) is the dominant strategy for Firm A 
with respect to Partner A2. The proof for Firm B is exactly the same. A combination of 
dominant strategies is necessarily a Nash equilibrium. In equilibrium, the maximum share- 
holder surplus provided by the rival auditors is the same for each client; therefore, based 
on Proposition 2, the auditors bid their break-even prices and share clients of each type. 
Note that under the equilibrium revenue-sharing rule «^? = «'(B'?)/K'(B'^), the break-even 
price equals the firm's cost, i.e., f^" = K'(g'?). It follows that in equilibrium each firm earns 
zero economic profit, and thus the partners' participation constraints are satisfied. 


Proof of Proposition 4 


Given the equilibrium sharing rules Bj, = B4, = BS, Ba = Ba = BO, aa, = aj 
= w$ a5 = a5 = aC, i € (1, 2), Conditions (17) and (18) imply that: 


Di KA D$ Ke Ke Ke andpt = KS —v. = Ke Kh KT. (A3) 
Since (A3) holds for all partners within the firm, we have dropped the subscript i. Also 
note that c” = v5" — KS > 0, of = v? —k$Kf/kG,oS$j = vS” — KS" > 0, and 


GÀ = U$ K£ KA. Thus, (A3) can be restated as (A4) or (A5): 
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o1 > of andog > oS. | (A4) 
vs" —o$ > KS and S° — ot > KS”. (A5) 


(A4) and Proposition 2 imply that Firm A (Firm B) audits all the S type (C type) 
clients. Moreover, (A5) and Proposition 2 imply that each firm earns positive economic 
profits from its niche market as shown by Equations (19) and (20), and thus the partners' 
participation constraints are satisfied. 

We now show that such sharing rules constitute an equilibrium. Observe that under 
these sharing rules, the value of audits is maximized for all auditors with respect to their 
clientele, and all auditors are induced to bid in their firms’ interests with respect to 
their clientele. Then, it is straightforward to verify that, given the rival firm's equilibrium 
sharing rules, Firm A (Firm B) cannot earn any economic profits from the C type (S type) 
clients or increase its economic profits from the S type (C type) clients by unilaterally 
changing its own sharing rules for any partner. Hence, Conditions (17) and (18) are suffi- 
cient conditions for these sharing rules to form an equilibrium. 


TABLE 1 
Table of Notation 


$ = probability of a company being a good company; 
V - current liquidating value of a good company; 
V' = end-of-period value of a good company; 
I = current liquidating value of a bad company; 
e = audit effort by engagement auditor; 
h = cooperation/helping effort by an auditor for his partner’s audit engagement; 
g = auditor’s report that the company is good; 
b = auditor’s report that the company is bad; 
p = audit quality, i.e., the (conditional) probability of detecting a bad company; 
C = complex type of client; 
$ = simple type of client; 
t = client type, z € (C, S); 
x = parameter of the audit quality function, u.(), x € (x, x), (C, S} € Q, x); 
I = legal payment made by an audit firm to shareholders in case of litigation; 
f = audit fee; 
f = auditor's break-even price for an audit engagement; 
y = auditor's compensation; 
R = audit fee revenues associated with an auditor's audit engagements; 
= legal liability costs associated with an auditor’s audit engagements; 
auditor's share of fee revenues from his own audit engagements; 
auditor's share of liability costs from his own audit engagements; 
Jump sum transfer between the partners of a firm; | 
engagement auditor's personal cost of an audit; 
K = firm's cost of an audit; 
v — gross benefit of an audit to shareholders; 
c = maximum amount of shareholder surplus an auditor can provide; 
Q = value of an audit; 
II, = economic profit of Firm r € (A, B]; and 
ni = number of f type clients in Location i € (1, 2), nf + n; = n, 


HH d H I 
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ABSTRACT: The ability to adapt decision making to the features of decision tasks and 
contexts Is likely an Important component of professional declsion making, but not all 
professionals will exhibit this adaptivity. One explanatlon for decision makers’ failure to 
adapt when adaptivity is appropriate is that they do not possess relevant knowledge 
of the features of decision tasks and contexts, or institutional knowledge. In this study, 
| examine the relations between institutional knowledge, information search adaptivity, 
and performance using an experimental research design In the tax decision-making 
setting. The results of the study are consistent with predictions. Tax professionals with 
relevant Institutional knowledge responded to the differential features of the tax- 
planning and compliance contexts by conducting broader and more extensive Infor- 
mation search in planning than in compliance; professionals lacking institutional knowi- 
edge did not exhibit such information search adaptivity. In addition, tax research 
performance Increased with information search adaptivity, and that adaptivity mediated 
the relation between institutional knowledge and tax research performance. This study 
extends the accounting and psychology adaptivity literatures and contributes to tax 
practice and education. 


Keywords: information search adaptivity; performance; knowledge; tax decision 
making. 


Data Availability: Contact the author. 


I. INTRODUCTION 
eatures of accounting decision tasks and contexts vary, and tailoring the decision 
process to those features is an appropriate adaptive strategy. However, this adaptivity 
can occur only if decision makers know that decision tasks and contexts differ in 
relevant ways and possess appropriate strategies to deal with those differences (Payne et 
al. 993). Institutional knowledge, or knowledge of the features of tasks and contexts in the 
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professional decision-making setting, is critical to exhibiting the adaptivity that is necessary 
to perform well across a variety of accounting tasks and contexts. Therefore, I predict that 
decision makers who understand the differential features of decision tasks and contexts and 
Whose decision strategies are based on an understanding of those features should perform 
better than do those without such knowledge. In this study, I examine (1) the relations 
between the features of an accounting decision setting and appropriate information search 
strategies, (2) the moderating role of institutional knowledge on information search adap- 
tivity, and (3) the relations of institutional knowledge, information search adaptivity, and 
` performance in a professional accounting task: tax research. 

Tax research includes the processes of issue identification, information search, authority 
selection and documentation, and conclusion formation. The results of the tax research 
process serve as the primary inputs to the advice preparation process. Tax professionals 
serve as client advisers and advocates in an environment characterized by significant ex- 
posure to practice risk (Kadous and Magro 2001), but they can reduce this exposure by 
relying on high-quality tax research. Because tax research is critical to tax decision making, 
it is important to understand how individual characteristics interact with features of tax 
decision contexts to drive tax research performance. Magro (1999) conducted a detailed 
task and context analysis and identified key differences between two primary tax decision- 
making contexts, tax planning and tax compliance.! She demonstrated that the planning 
context is characterized by greater ambiguity, complexity, and justifiability demands than 
is the compliance context. While tax professionals conduct information search using the 
same tools and information sets in both contexts, tax professionals in the planning context 
should respond to these contextual differences by conducting broader and more extensive 
information search than do professionals in the compliance context. I predict, however, that 
institutional knowledge acquired through experience with the contexts moderates the rela- 
tion between context and adaptivity. Specifically, I predict that tax professionals who do 
not possess relevant institutional knowledge will not exhibit the predicted information 
search adaptivity. Further, I predict that the ability to appropriately adapt information search 
to the context is a characteristic of good performance in tax research and that adaptivity 
mediates the relation between knowledge and performance in this setting. 

To test these hypotheses, I conducted an experiment using tax staff and seniors from 
an international accounting firm as participants. Decision context was manipulated within 
participant, and institutional knowledge was measured. As predicted, those with institutional 
knowledge exhibit the predicted adaptivity by conducting broader and more extensive in- 
formation search in planning than in compliance, but those without such knowledge did 
not. Tax research performance increased with adaptivity, and adaptivity mediated the rela- 
tion between institutional knowledge and performance. That is, those who knew that plan- 
ning is characterized by greater ambiguity, complexity, and justifiability demands than is 
compliance conducted broader and more extensive information search in planning, and that 
information search adaptivity contributed to better performance on the overall tax research 
task. 

This study makes three main contributions to the literature. First, the study demonstrates 
that task, context, and individual characteristics combine to drive information search strat- 
egies in a professional setting and that a detailed analysis of the setting is necessary to 


! Tax planning occurs when the client approaches the tax professional for advice before all facts are established 
and the opportunity exists to affect the way a transaction will be structured. Tax compliance occurs when a 
client approaches the tax professional after all facts have been established or when tax considerations will not 
affect the structure of a transaction. 
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predict when and how decision makers will adapt their decision strategies. Second, this 
study is the first to focus on the importance of institutional knowledge to decision-making 
process and performance in accounting. Finally, the results of this study contribute to tax 
practice. The results suggest that information search adaptivity is an important component 
of tax research performance and mediates the relation between institutional knowledge and 
performance. The results of this study suggest that educators and accounting firms can 
improve the education and training of tax professionals by providing opportunities for them 
to acquire institutional knowledge, including knowledge of both task and context features 
and appropriate adaptive strategies. 

In the next section, I develop my theory and hypotheses, and in Section IH I describe 
my experimental method. Section IV contains the results and discussion, and the final 
section contains conclusions, limitations, and directions for further research. 


IIl. THEORY 

Contingent Decision Behavior 
Adaptive Decision Making 

Contingent decision behavior is a term used to describe the phenomenon by which 
decision makers employ varying decision strategies dependent on the situation (Payne et 
al. 1993; Petty and Cacioppo 1986; Beach and Mitchell 1978; Reder 1987; Siegler 1988). 
With their adaptive-decision-maker framework, Payne et al. (1993) make a significant con- 
tribution to our understanding of contingent decision making by drawing on a sizable body 
of research (see Payne et al. [1993] for a review) to develop a coherent theory that predicts 
and explains such behavior. Three principles of the adaptive-decision-making framework 
are that (1) decision strategies have different characteristics, (2) features of the decision 
problem (i.e., task and context features)? the social context of the decision, and the decision 
maker influence the anticipated costs and benefits of decision strategies, and (3) de- 
cision makers attempt to balance cost and benefit concerns when deciding how to make a 
decision. Thus, different individuals may employ different strategies for the same problem, 
or the same individual may select different strategies at different times. While this basic 
framework is valid across decision settings, the features of decision settings and measures 
of adaptivity considered in prior studies may not be meaningful in more naturalistic decision 
settings characterized by complex, unstructured tasks (Klein et al. 1993). 


Features of a Professional Setting: Tax Decision Making 

Accounting decision settings are characterized by features that distinguish them from 
the settings previously studied in the adaptivity literature, including well-educated profes- 
sional decision makers, a multiperson decision-making environment, and the existence of 
a vast body of professional and legal standards that inform the decision process. The tax 
decision-making setting is further distinguished by the. prominence of formalized infor- 
mation search. While important in many decision settings, information search is the primary 
focus of tax research, a fundamental process in tax decision making. The product of the 
tax research process (usually a memo with supporting documentation) serves as the prin- 
cipal input to the client advice-formation process, and forming advice for the client is 
generally the purpose of tax engagements. 

Another important feature of the tax setting is that most decision making occurs in one 
of two decision contexts, tax planning or tax compliance. In planning, the client has not 


z Task features that have been considered include task complexity, response mode, information display, and agenda 
effects. Context features include similarity, quality of the option set, reference points, and framing. 
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yet entered into a transaction and the opportunity exists to affect the way in which the 
transaction is structured. In compliance, the client has either completed the transaction in 
question or is unable or unwilling to alter how a planned transaction will be implemented. 
In her descriptive study of features of tax decision contexts, Magro (1999) shows that 
different information characteristics, goals, responsibilities, and potential costs between 
planning and compliance contexts cause planning problems, in general, to be characterized 
by greater ambiguity, complexity, and justifiability demands than are compliance problems. 
I extend the prior literature by developing and testing predictions about how differences 
across tax contexts and professionals manifest in adaptivity along specific information 
search dimensions and differential tax research performance. 

Ambiguity and complexity. Ambiguity is a state of uncertainty with respect to infor- 
mation, decision alternatives, constraints, and the criteria by which decisions will be judged 
(Beach and Mitchell 1978; Payne et al. 1993). In contrast to the compliance context, a 
specific transaction structure has not been implemented by the client in the planning context 
(in fact, the objective in tax planning is to determine what the facts should be in order to 
achieve the client's tax and economic goals), and the law as well as the client's character- 
istics, resources, and goals can change during the planning process (Magro 1999)? Such 
changes can expand the set of potential transaction structures, render previously appropriate 
structures unusable, or impose different criteria by which the decision will be evaluated. 
lask complexity increases with the number of problem components, the number of criteria 
on which the decision will be evaluated, and the decision's potential influence on future 
events (Beach and Mitchell 1978; Payne et al. 1993). Planning is characterized by more 
problem components and criteria by which decisions will be judged than is compliance. 
The tax professional's goals in planning are to identify transaction structures and tax treat- 
ments that satisfy both the client's overall economic and tax goals subject to statutory and 
client constraints. In compliance, the tax professional’s goal is only to identify appropriate 
tax treatments taking into consideration the client's tax goals and statutory constraints 
(Sommerfeld and Streuling 1981; Causey and McNair 1990). Planning judgments, which 
determine the structure of transactions, also have greater potential to influence future events. 
For example, the choice of business entity may have significant long-term tax and economic 
effects for the client. Together these features make the planning context more ambiguous 
and complex than is compliance. 

Justifiability demands. The need to support one's judgments and decisions is an im- 
portant feature of the social context in which professional decision making occurs. Differ- 
ences in practice risk (Kadous and Magro 2001) drive differences in the justifiability de- 
mands of tax planning and compliance (Magro 1999). Planning work exposes the tax 
professional to greater risk than does compliance work because compliance advice is sub- 
sumed by planning advice. The Statement on Standards for Tax Services (SSTS) No. 8 
(AICPA 2000) states: “In advising or planning with a client on tax matters, the CPA should 
assume that the advice will affect the manner in which the matters or transactions consid- 
ered ultimately will be reported on the client's tax returns. Thus, for all tax advice the CPA 
gives to a client, the CPA should follow the standards in SSTS No. 1 relating to tax return 
positions." As such, a professional who gives bad advice in tax planning may be subject 


3 For example, client facts would change if an S-corporation requested advice, but before implementing a rec- 
ommended transaction structure, lost its S-corporation status. Goals would chenge if a client intended to finance 
an acquisition with cash but other needs for that cash arose, leading them to consider alternative means to 
finance or alternative structures like exchanges or installment sales to effect the acquisition. 
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to all of the costs associated with compliance work* plus the additional civil liability and 
loss of reputation associated with planning malpractice litigation. This additional civil lia- 
bility results because planning advice is more significant for the client than is compliance 
advice—clients may incur significant costs to implement tax planning advice and such 
implementation is likely irreversible with long-term effects that go far beyond the scope of 
taxes (Quattrochi 1982). Compliance advice neither imposes the same potential costs nor 
has the same potential impact on the client. 

A consideration of educational materials, professional assessments of contextual fea- 
tures, and tax malpractice insurance activity supports the contention that tax professionals 
are more vulnerable in planning than in compliance. Tax research textbooks identify plan- 
ning work as riskier for the professional (Karlin 2003) and recommend using disclaimers 
when communicating tax-planning advice to clients (Sommerfeld and Streuling 1981). Ex- 
perienced tax managers in an experimental setting assess the justifiability demands of plan- 
ning problems as higher than those of compliance problems (Magro 1999). While more 
malpractice insurance claims are filed in compliance than in planning; tax professionals 
are exposed to greater expected costs in planning because the costs of settling claims is 
greater. In addition, more malpractice lawsuits are filed in planning, leading to higher 
litigation costs (Ferguson 1996), and cases that make it to trial are likely those that pose 
the greatest risk to the tax professional’s reputation. In the next subsection, I address the 
moderating role of institutional knowledge in the relation between these contextual differ- 
ences and appropriate adaptive responses. 


Institutional Knowledge 

Payne et al. (1993) argue that individuals will not exhibit adaptive-decision-making 
behavior if they fail to (1) identify differences between the decision problems or contexts, 
(2) recognize that an adaptive response to identified differences is appropriate, or (3) gen- 
erate appropriate adaptive strategies in response to identified differences. Thus, adaptivity 
is only expected when the decision problems or contexts differ and the decision maker 
possesses knowledge of relevant differences, the appropriateness of an adaptive response, 
and appropriate adaptive strategies. As such, any test of adaptivity is really a joint test of 
the three conditions. 

The importance of knowledge to adaptive decision making is not surprising as knowl- 
edge is well established as a key component of performance (Libby and Luft 1993; Libby 
1995). Types of knowledge relevant to accounting decision making include technical knowl- 
edge (Cloyd 1995b, 1997; Spilker 1995), tacit knowledge (Libby and Tan 1997), client and 
general business knowledge (Bell et al. 1997), and institutional knowledge (Magro 1999). 
Particularly important in the tax setting are knowledge of the business setting, knowledge 
of the client, technical tax knowledge (i.e., knowledge of the tax law as it applies to specific 


^ jn the compliance context, the tax professional may (1) be subject to preparer penalties (generally in the $250— 
$1,000 range) if she completes the return, (2) be liable for client interest or client penalties (usually a percentage 
of the additional tax due), and (3) face loss of reputation. 

> Although almost half of the tax claims to the AICPA-endorsed professional liability insurer are for compliance 
issues such as math or filing errors (Stevens 2000), these constitute frequent, but relatively low-cost, claims that 
are usually resolved out of court (Kaban 1999). Such claims likely are not associated with great loss of reputation, 
which tax professionals cite as the most important cost they face (Collins et al. 1990). 

$ In a personal communication, Joseph Wolfe, Director of Risk Management for the AICPA-endorsed professional 
insurance program, indicates that the average cost of defending and settling claims is about three times greater 
for tax-planning than for tax compliance claims. Tax-planning claims are frequently settled for over $100,000, 
while most tax compliance claims cost less than $50,000 to defend and settle. 
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transactions), and institutional. tax knowledge (i.e., knowledge of the tax decision-making 
environment). While each of these types of knowledge likely has different effects on tax 
professionals' judgments and decision making, prior research in tax has focused largely on 
technical tax knowledge. Possession of such knowledge has been shown to have a positive 
effect on identification of tax issues (Bonner et al. 1992), assessment of distinguishing 
features of tax cases (Davis and Mason 2003), recommendations of reporting positions 
(Cloyd 1995a; Roberts and Klersey 1996), information search under accountability demands 
(Cloyd 1995b, 1997), and keyword selection (Spilker 1995). 

Several prior studies have investigated the effects of contextual features on tax decision 
making (cf., Cloyd 1995b, 1997; Kadous and Magro 2001; Alexander 2003), but only one 
prior study has investigated the effects of institutional knowledge on tax professionals’ 
judgment and decision making. Spilker (1995) addressed the effects of declarative knowl- 
edge (operationalized as technical knowledge), procedural knowledge (operationalized as 
tax research skill), and time pressure on keyword selection. The possession of a domain- 
specific skill, such as the ability to use a tax database to complete tax research, is an element 
of knowledge about the decision setting and, as such, a type of institutional knowledge. 
Participants with both technical and institutional knowledge identified more relevant key- 
words than did those with only technical knowledge, regardless of time pressure, and their 
performance advantage increased in the presence of time pressure. Spilker's (1995) study 
demonstrates that institutional knowledge can be an important component of performance 
in a tax task and that technical and institutional knowledge can have distinct effects on 
performance. 

An important element of institutional knowledge in tax is knowledge of the differential 
contextual features of planning and compliance. Tax research begins with the identification 
of issues based on the client's facts. Using these issues as a starting point, the tax profes- 
sional conducts information search to identify authorities that inform these issues and to 
meet professional and statutory requirements. This process is iterative with the set of issues 
and relevant facts refining or expanding during the information search process. From the 
authorities accessed during information search, the tax professional selects authorities to 
include in the communication of conclusions. 

Because tax planning is characterized by greater ambiguity, complexity, and justifia- 
bility demands than is compliance, the tax professional has incentives to employ different 
information search strategies in planning and compliance to achieve desired levels of de- 
cision quality. Two relevant dimensions of information search in tax are extent and breadth,’ 
where extent describes how much search is done and breadth describes which authorities 
are accessed during search. The authorities accessed during search can be described along 
several dimensions including the content of the authority, the level of authoritative support 
provided by the authority, or the type of authority. Specific contextual differences across 
planning and compliance are more likely to result in differences in one dimension of breadth 
than another. Specific predictions with respect to the relations between ambiguity, com- 
plexity, and justifiability demands and information search breadth and extent are developed 
below. 

Planning is more ambiguous and complex than is compliance and is therefore charac- 
terized by more problem components and decision criteria and by greater uncertainty and 
potential impact on future events. In response to these differences, tax professionals should 


? Some prior research on informationsearch in accounting characterized information search as either directed or 
sequential (Biggs et al. 1988). While information search in tax could be described similarly, there is no theoretical 
basis for predicting differences in the directedness of search across the planning and compliance contexts. 
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access authorities with different content and level of authority during information search. 
First, the professional should access a set of authorities that address multiple issues and 
transaction structures.š Further, tax professionals can use secondary sources? that summarize 
various types of primary authorities to gain a better overall understanding of the law without 
accessing those sources directly. In the compliance context, these types of search have less 
value because the decision problem is delineated with respect to one transaction and its 
reJated issues, and the tax professional knows that the facts and relevant law will not change. 

This analysis suggests that ambiguity and complexity differences affect the focus of 
search across planning and compliance contexts by making a broad search with respect to 
the content of the authorities and their level of authority more valuable in planning. Decision 
makers, however, cannot exhibit such information search adaptivity without knowledge of 
the contextual differences that make the adaptivity appropriate. The moderating role of in- 
stitutional knowledge in the relation between contextual features and information search 
adaptivity is formalized in the following hypothesis. 


H1: Tax professionals who know (do not know) that the planning context is character- 
ized by greater ambiguity and complexity than is the compliance context will (will 
not) conduct broader information search in planning than in compliance. 


À. similar argument can be made with respect to justifiability demands in planning and 
compliance. Given the increased exposure to practice risk in the planning context, the tax 
professional should make greater efforts to identify plans with high levels of support and 
to accurately assess the risks of those plans to satisfy the increased justifiability demands. 
To meet statutory and professional support requirements, the tax professional is required to 
consider all types of primary authorities (i.e., IRC sections, Regulations, cases, Revenue 
Rulings, etc.) and to identify all relevant authorities before advising the client. The con- 
sequences of failing to do so are greater in planning than in compliance, so the tax pro- 
fessional has an incentive to conduct both a more extensive search by considering more 
authorities (to meet requirements to identify all relevant authorities) and a broader search 
by considering more types of authorities in planning (to meet requirements to consider all 
primary authority types). While the increased ambiguity and complexity of planning is 
predicted to result in the use of more secondary sources that summarize multiple primary 
authorities, the increased justifiability demands of the planning setting should lead the tax 
professional to directly access the primary sources as well. Tax professionals, however, 
cannot respond to the increased justifiability demands in planning unless they know that 
such demands exist. The prediction that institutional knowledge moderates the relation 
between the context and information search adaptivity leads to the following hypothesis. 


H2: Tax professionals who know (do not know) that the planning context is character- 
ized by greater justifiability demands than 1s the compliance context will (will not) 
conduct broader and more extensive information search in planning than in 
compliance. 


8 For example, if a taxpayer wished to transfer stock to her child, she could structure the transaction as a gift, a 
sale, an exchange, or a bequest. Each of these would be considered a different transaction structure. 

? Secondary sources are those that provide no authoritative support in litigation and cannot be relied upon when 
assessing the strength of support for a position. Primary authorities are those upon which the tax professional 
may rely. 
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Tax Research Performance 


The tax research process is important because the products of that process (often a 
memo and supporting documentation in the multiperson decision setting) serve as the pri- 
mary inputs to the advice-formation process. Documentation of the research process is also 
critical when defending the firm in litigation or against challenges by the IRS. Tax managers 
and partners at an international accounting firm report that, when evaluating tax research 
quality, they consider the (1) quality of the underlying information search process (com- 
prehensiveness, efficiency, and authorities selected), (2) quality of authority analysis, (3) 
quality of documentation, (4) strength of arguments, (5) level of authoritative support for 
positions recommended, and (6) quality of the writing (interviews at an international ac- 
counting firm). 

Information search adaptivity in response to contextual differences should be associated 
with tax research performance because it impacts the quality of the underlying information 
search process and the level of authoritative support for the positions recommended. For 
example, failing to conduct an extensive search in the planning context would affect the 
perceived comprehensiveness of information search. Further, failing to conduct a broad 
search in the setting would impact the specific authorities selected (a characteristic of the 
quality of the search) and thus the authoritative support for the positions recommended. As 
a key determinant of information search adaptivity, institutional knowledge of contextual 
features should have an indirect effect on tax research performance through its effect on 
information search adaptivity. | 


H3: Tax research performance increases with information search adaptivity. 


H4: The effect of institutional knowledge of ambiguity, complexity, and justifiability 
demands differences across planning and compliance contexts on tax research per- 
formance is mediated by information search adaptivity. 


Ill. RESEARCH METHOD 
Task 


The experimental instrument contained two tax research assignments. Each assignment 
provided information about a different client and included a brief history of the client's 
business, a discussion of pertinent relationships, and a description of a completed or planned 
transaction. Participants were instructed to conduct tax research, document those sources 
they wished to submit, and convey the results of their research in a memo to the tax 
manager. Participants were allowed to use only the CCH ACCESS database for research 
and were limited to two hours to complete each assignment (total experiment length was 
4.5 hours including the post-experiment questionnaire).’° Participants completed the re- 
search cases while working at their own computers during a designated block of time. Two 
administrations of the experiment were conducted. As analysis of the data shows no effect 
of group, they are combined in all further analyses. 


Participants 
Twenty-eight staff- and senior-level tax professionals employed by an international 


accounting firm participated in the experiment. Tax professionals at this level complete a 
significant majority of the tax research assignments in the firm. Participants were volunteers 


'? Based on interviews at an international accounting firm, initial investigations of research issues generally last 
three to four hours. I limited research to two hours for each project due to participant availability constraints. 
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recruited by a firm partner. Participants’ tax experience ranged from eight to 42 months 
with a mean (SD) of 16.9 (6.9) months. Nineteen participants had advanced degrees, in- 
cluding 12 J.D.s. Demographic data suggest that participants spend a significant proportion 
of their chargeable hours on tax research, are familiar with the CCH ACCESS database, 
and conduct research related to both planning and compliance problerms.!! Table 1 presents 
selected demographic data by high and low institutional knowledge group. 


Iucentives 


Incentives were provided both to encourage participation and to motivate participants 
to take the project seriously. To encourage participation in the study, volunteers received 
training hours for their participation and were entered into a lottery for two prizes (a pair 
of tickets to see a professional basketball game or $250). The lottery was run separately 
for each administration of the experiment. To motivate them to take the project seriously, 
participants were informed that firm managers would evaluate their research products (re- 
search memo and supporting documentation) and that 25 percent of the participants would 
be contacted later to explain their work. The possibility of being held accountable for their 
work products is representative of the natural setting in which staff submit their work 
products to a supervisor who evaluates them. To maintain the integrity of the experimental 


TABLE 1 
Mean (SD) Demographic Variables 


Panel A: Knowledge of Ambiguity and Complexity Differences 


AMB/COMP. KNOW* JUST. KNOW 
High Low High Low 
n = 18 n — 10 n — 10 n = 18 
Experience (months) 17.64 14.40 20.40 14.28 
(8.100) (3.096) (9.876) (3.012) 
J.D. (percentage of participants) 50.00 30.00 60.00 33.33: 
Percentage of time on planning work 57.33 23.00*** 58.20 37.78 
(23.522) (20.575) (23.218) (27.930) 
Percentage of time on compliance work 34.89 57.00** 33.80 47.78 
(19.041) (15.312) (18.558) (20.380) 
Exposure to partnership problems 55.56 40.0 50.00 50.00 
(percentage of participants) 
Exposure to S-corporation problems 50.0 40.0 50.00 44.44 
(percentage of participants) 
Exposure to both technical topics 44.44 40.0 40.00 44.44 


(percentage of participants) 


* ** *** p < 0.05, p < 0.01, and p < 0.001, respectively. 

* AMB/COMP... KNOW is the measure of institutional knowledge of ambiguity and complexity differences 
across planning and compliance. 

b JUST..KNOW is the measure of institutional knowledge of justifiability demand differences across planning 
and compliance. 


U Participants reported that they spend, on average, 34.1 percent of their chargeable hours on tax research. Reports 
ranged from 5 percent to 90 percent, with the two most senior participants reporting the lowest percentages. 
Participants reported that they use the CCH ACCESS database for 86 percent of their tax research assignments 
on average and that their total tax research time is allocated between planning (53 percent), compliance (41 
percent), and other research (6 percent). 
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Instructions, managers actually evaluated staff work products, and I contacted seven partic- 
Ipants to request clarification or additional Information. 


Independent Variables 
Context 

Context was manipulated within-participant at two levels (planning and compliance) 
by changing the timing of the client's request for advice. In the compliance condition, the 
transaction had already occurred while in the planning condition the client was considering 
the transaction. The planned transaction had negative tax effects that could be avoided by 
restructuring the transaction. Each participant completed one research assignment that in- 
volved the sale of a partnership interest and one that addressed a distribution by an S- 
corporation to a shareholder. Half of the participants did the partnership problem in the 
planning context and the S-corporation problem in the compliance context, and the other 
half did the partnership problem in the compliance context and the S-corporation problem 
in the planning context. The order of the problems was manipulated, yielding four condi- 
tions. Order is not significant, and the data are collapsed across order groups in all reported 
analyses. Whether in the planning or compliance contexts, the materials identified only one 
potential structure for the transaction. 


Institutional Tax Knowledge 

I created three dichotomous variables (AMB.. KNOW, COMP .KNOW, and JUST . 
KNOW) representing the participant's knowledge of ambiguity, complexity, and justifiability 
demand differences across tax planning and compliance contexts. In the post-experimental 
questionnaire, participants were asked to assess the complexity, ambiguity, and justifiability 
demands of planning and compliance problems. Each feature was defined!? then immedi- 
ately followed by a question of the form: “In general, how ambiguous are compliance 
problems?" ? For the ambiguity question, the endpoints of the seven-point Likert response 
scale were anchored by “Not at all Ambiguous” and “Completely Ambiguous.” These 
questions were embedded in an 82-item questionnaire that contained 59 items on which 
they responded using Likert scales. The three questions related to planning were grouped 
together and were separated by approximately 20 questions from the group of three com- 
pliance questions. 

To control for the fact that individuals may use Likert scales differently, participants 
also responded to a question at the end of the questionnaire that was designed to identify 
how they used the scale. Participants were asked to place a vertical line through the point 


12 I did not define the terms ''planning" and "compliance" for participants because they are introduced in all 
leading introductory tax texts, and tax professionals generally share a uniform definition of these terms. To the 
extent that the participants in this study did not share a common definition of these terms, noise but not bias is 
introduced into the data. 

P? I use institutional knowledge measures based on assessments of planning and compliance contexts in general 
rather than using questions assessing the specific planning and compliance cases used in the study because, 
while the perceived features of the specific problems being researched might affect information search behaviors, 
information search strategy in initial research is likely driven by ex ante beliefs about the features of the contexts 
in general. Further, the cases were carefully selected to be similar in ambiguity, complexity, and justifiability 
demands, so any assessed differences across case by the participants should be attributable to differences in 
context. Results are qualitatively the same using knowledge measures constructed from case-specific questions. 
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on the scale that delineated the smallest meaningful difference from a designated point.14 
Institutional knowledge was considered high (value of 1) when the difference between the 
participant's ratings for the two contexts exceeded this self-reported meaningful difference 
on the rating scale. I combined AMB. KNOW and COMP_KNOW to form AMB/COMP.. 
KNOW, which takes on a value of 1 when either AMB. KNOW or COMP_KNOW is high." 
Using these methods, 18 participants were classified as high AMB/COMP_KNOW and ten 
were classified as high JUST. KNOW.!9 I test H1 and H2 with AMB/COMP. KNOW and 
JUST .KNOW. 

To test the mediating role of institutional knowledge in the relation between adaptivity 
and performance predicted in H4, I created a measure of overall institutional knowledge, 
INST..KNOW, by summing AMB/COMP_KNOW and JUST. KNOW. Because AMB/ 
COMP..KNOW and JUST. .KNOW each take on a value of 0 or 1, INST. KNOW takes on 
a value of 0, 1, or 2. 


Information Search Adaptivity 

I developed an overall measure of adaptivity to test H3 and H4, which address the 
effects of information search adaptivity and institutional knowledge on performance. I cal- 
culated z-scores for each of the five search dimensions used as the dependent variables in 
tests of H1 and H2 (described below) and computed the difference for each across the 
planning and compliance contexts. I then summed the difference scores across the five 
dimensions to yield the variable ADAPTIVITY. 


Dependent Variables 
Information Search Adaptivity 

The CCH ACCESS PATH function keeps a complete record of all research steps on a 
project, and its printout cannot be altered without detection. Using the participants' PATHs, 
I compiled a comprehensive list of sources accessed by the participants for each problem. 
Participants, in total, identified 354 sources in one problem and 283 sources in the other." 
I classified each source along the following dimensions: (1) transaction structure addressed, 
(2) type of authority (e.g., IRC section, Regulation, Revenue Ruling, or CCH Explanation), 
and (3) issue addressed. À second rater with ten years' tax experience classified each source 
with respect to the subjective measures, transaction structure, and issue, for a sample of 
127 of the 637 authorities. Each of the sources was classified without rater knowledge of 


14 Participants received a seven-point Likert scale anchored by 1 and 7 with a vertical line (labeled “A”) drawn 
through the scale at 3. The instructions read, “You answered several questions using scales like the one below. 
To help us understand how you used this scale, please place a vertical line at the point that marks the smallest 
important difference in ratings from the indicated line ‘A.’ That is, when evaluating your responses to the 
questions on the preceding pages, we should consider the ratings of two previous items significantly different 
if the between your marked lines on those items is at least as great as the distance between line ‘A’ 
and your mark on the scale below.” 

15 While raw ambiguity and complexity differences between planning and compliance varied within subject, all 
subjects with high AMB_KNOW also had high COMP_KNOW. AMB_KNOW and COMP_KNOW were com- 
bined because ambiguity and complexity are predicted to have similar effects on the search dimensions measured 
in this study. 

16 One participant in each low-knowledge group actually assessed greater ambiguity and complexity or justifiability 
demands for compliance than for planning. If these participants are reclassified as high-knowledge with their 
results reverse-scored, the results are qualitatively the same. 

17 I count as a source the individual units available through CCH ACCESS. For example, one paragraph of an 
Internal Revenue Code section, multiple paragraphs of an Internal Revenue Code section, a case, or an entire 
tax bill each would be designated as one source if CCH has created the access to the source in that size chunk. 
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the condition in which it was generated or the subject who identified it. Inter-rater reliability 
averaged 88.9 percent and ranged from 87.3 percent to 91.2 percent.!* 

Search breadth is a multidimensional theoretical construct that was operationalized in 
this study as number of issues, number of transaction structures, percentage of secondary 
sources, and number of authority types. While other measures of search breadth in tax may 
exist, these measures are particularly suited to the contextual features manipulated herein. 
Number of issues and number of transaction structures were measured as the number of 
unique issues and transaction structures, respectively, addressed in the sources accessed by 
a participant. Percentage of secondary sources was calculated by determining the authority 
type of each source, coding the sources as primary or secondary based on definitions in 
tax texts, and calculating the percentage of secondary source authorities in each participant's 
authority set. This percentage was transformed using the arcsin transformation for statistical 
testing. Percentage rather than number of secondary sources was used because it measures 
the emphasis on secondary sources rather than just the number of such sources. For ex- 
ample, one would interpret the fact that a participant accessed five secondary sources dif- 
ferently if the participant accessed a total of ten authorities than if he accessed 30. Number 
of authority types was measured as the number of different authority types accessed by a 
participant. 

Extent of search is also a multidimensional theoretical construct but is operationalized 
here with a single measure, number of authorities accessed. While time of search would 
be another appropriate measure of the extent of search in tax, participants in this study 
were under significant time pressure and all used the full time available to them to complete 
the research.. Number of sources was measured as the total number of sources accessed 
minus the number of repeat hits. 

These search measures were used as the dependent variables to test the hypotheses 
relating context and institutional knowledge to adaptivity. Consistent with the theory, I test 
H1 with number of issues, number of transaction structures, and percentage of secondary 
sources. Number of authority types would not be an appropriate measure of breadth here 
because increased complexity would not increase the value of considering both Revenue 
Rulings and case law and increased ambiguity would be addressed by considering more 
secondary sources that summarize multiple authority types. Consistent with the theory with 
respect to justifiability demands, I test H2 with number of authority types (breadth of 
search) and number of authorities (extent of search). 


Tax-Research Performance 

The performance measure for H3 and H4 is a continuous variable (QUALITY) based 
on the average assessed quality of each participant's tax-research products (i.e., the planning 
and compliance memos). The tax manager (a CPA with a J.D., an M.S. in accounting, and 
an L.L.M. in tax) was from the same accounting firm as the participants and had significant 
accounting (14.6 years) and tax (6.1 years) experience. À second rater also evaluated the 
research products of the participants. This rater was a doctoral student with 17 years' tax 
experience who had been a Big 5 tax manager and a sole practitioner. Both were unaware 
of the research hypotheses, and evaluations were blind. The two evaluators’ assessments 
are highly correlated (r = 0.901). Reported analyses are based on the tax manager's 
assessments. 


!3 Inter-rater agreement on the S-corporation problem was 87.3 percent for issues and 87.3 percent for transaction 
structures. Inter-rater agreement for the partnership problem was 89.7 percent for issues and 91.2 percent for 
transaction structures. 
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Each evaluator received a packet containing the following: (1) instructions and demo- 
graphic questions, (2) research assignments, completed memos, and supporting documen- 
tation, and (3) pre-numbered rating sheets. The instructions included a description of the 
conditions under which the staff members were to complete the research and instructions 
to “evaluate the staff member's research as if you had made the assignment and the staff 
member had submitted the work for your use." The memos were grouped by decision 
context and technical topic (e.g., all partnership compliance memos were grouped together) 
with memos arranged randomly within each group. The evaluators rated quality on rating 
sheets that included a 101-point response scale (anchored at 0 by "Lowest Possible Quality" 
and at 100 by “Highest Possible Quality") and space for additional comments or rationale. 


Research Cases 


To control for differences other than context that could cause information-search vari- 
ance across research assignments, I selected two research assignments from those used in 
Magro (1999). Magro (1999) developed planning and compliance versions of six research 
assignments based on the Price Waterhouse Case Studies in Tax (Jones 1994). In her study, 
20 managers assessed the ambiguity, complexity, and justifiability demands of six cases in 
either the planning or compliance context. The two cases used in the present study were 
chosen from this set based on managers’ assessments that the two cases were not signifi- 
cantly different on ambiguity, complexity, and justifiability demands within context (lowest 
p = 0.242). While managers' assessments of these features did not differ, the participants 
in this study might perceive the features of the problems differently. As such, I also con- 
sidered the participants' assessed ambiguity, complexity, and justifiability demands of the 
specific research problems in the study. Similar to the results for the managers, participants” 
assessments of the features did not differ within context for the partnership and S- 
corporation problems (smallest p — 0.417). Means by problem and context for each group 
are presented in Table 2. As the assessments of ambiguity, complexity, and justifiability 
demands do not vary with assignment topic, differences in information search between the 
planning and compliance contexts in this study can be attributed to differences in the 
complexity, ambiguity, and justifiability demands between the contexts themselves and not 
to differences in the characteristics of the specific assignments. 


Manipulation Check 


A manipulation check for the context variable (planning/compliance) was included in 
the post-experimental questionnaire. Participants were asked two questions of the form, “In 
the (partnership) assignment, did the sale of the partnership interest occur before the client 
sought your firm's advice?" An answer of “No” in the planning condition and "Yes" in 
the compliance condition was interpreted as a successful manipulation. All participants 
answered both questions correctly. In addition, participants' assessments of ambiguity, com- 
plexity, and justifiability demands for the planning cases exceeded those for the compliance 
cases (all p « 0.001). 


IV. RESULTS AND DISCUSSION 
Hypothesis 1 
In H1, I predict that tax professionals who possess the institutional knowledge that 
planning is characterized by greater ambiguity and complexity than is compliance will 
conduct a broader search in planning than in compliance, and those without such knowledge 
will not. Results of a repeated measures MANOVA (Khattree and Naik 1999) with context 
and AMB/COMP.. KNOW, the measure of institutional knowledge, as independent variables 
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TABLE 2 
Mean (SD) Assessments of Context Features by Technical Topic 


Panel A: Managers (Magro 1999) 


Context Partnership S-Corporation t 
Ambiguity Planning 4.14 4.49 t = 0.91 
(1.047) (1.288) 
Compliance 2.79 3.06 t; = 1.25 
(0.990) (0.735) 
Complexity Planning 5.28 5.29 t; = 0.03 
(0.755) (0.731) 
Compliance 4.22 4.30 t = 0.15 
(1.451) (0.906) 
Justifiability Demands Planning 4.76 4.75 t, = —0.04 
(0.985) (0.997) 
Compliance 4.28 4.05 t, = —0.57 
(1.293) (0.826) 
Panel B: Tax Staff and Seniors 
Ambiguity Planning 5.01 4.92 t, = —0.28 
(0.901) (0.690) 
Compliance 3.94 3.71 t, = 0.56 
(0.813) (1.333) 
Complexity Planning 5.03 5.14 bs = 0.39 
(0.863) (0.676) 
Compliance 4.23 4.13 bs = 0.25 
(0.884) (1.206) 
Justifiability Demands Planning 5.53 5.69 tiga = 0.56 
(0.997) (0.465) 
Compliance 4.91 4.63 ts = 0.82 
(0.753) (1.016) 


* ** *+** 5 < 0.05, p < 0.01, and p < 0.001, respectively. 


and number of issues, number of transaction structures, and percentage of secondary sources 
as dependent variables suggest that knowledge and context have an interactive effect on 
breadth of search (F, 25 = 4.42, p = 0.045). Tests of simple effects for the breadth measures 
show that in all cases search is broader in planning than in compliance for the high- 
knowledge group but not for the low-knowledge group. Descriptive statistics are presented 
in Table 3. For the high-knowledge group, search is broader in planning than in compliance 
for number of issues (t,, = 4.48, p < 0.001), number of transaction structures (t,, = 3.79, 
p = 0.002), and percentage of secondary sources (tj, = 3.72, p = 0.002). For the low- 
knowledge group, search is not broader in planning than in compliance for number of issues 
(t, = 0.62, p = 0.550), number of transaction structures (t, = 0.80, p = 0.445), or percentage 
. Of secondary sources (t, = 0.61, p = 0.558). 


Hypothesis 2 

In H2, I predict that search will be broader and more extensive in planning than in 
compliance when people know that planning is characterized by greater justifiability de- 
mands than is compliance. Without such knowledge, planning and compliance search will 
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TABLE 3 
Breadth of Search by Knowledge of Ambiguity and Complexity (A4MB/COMP. KNOW?) 
Panel A: Number of Issues 
Planning Compliance t 
High AMB/COMP..KNOW (n = 18) 
Mean 13.11 6.83 = 448*** 
(SD) (6.07) (3.95) 
Low AMB/COMP KNOW (n = 10) 
Mean 14.80 13.70 t, = 0.62 
(SD) (6.03) (4.30) 


Panel B: Number of Transaction Structures 
High AMB/COMP ..KNOW (n = 18) 


Mean 7.78 3.83 t, = 3.79** 
(SD) (4.82) (2.07) 

Low AMB/COMP_KNOW (n = 10) 
Mean 8.60 7.30 ty = 0.80 
(SD) (4.74) (2.79) 


Panel C: Percentage of Secondary Sources 
High AMB/COMP..KNOW (n = 18) 


Mean 54 24 try = 3.72** 
(SD) (.13) (11) 

Low AMB/COMP_KNOW (n = 10) 
Mean 0.61 56 t = 0.61 
(SD) (.04) (.09) 


*, **, *** p < 0.05, p < 0.01, and p < 0.001, respectively. 
« AMB/COMP_ KNOW i is the measure of institutional knowledge of ambiguity and complexity differences across: 
planning and compliance. 


not differ. Here, breadth is measured by the number of authority types, and extent is mea- 
sured by the number of sources accessed. Descriptive statistics by JUST_KNOW are pre- 
sented in Table 4. A repeated measures ANOVA with JUST. KNOW and context as inde- 
pendent variables reveals a significant knowledge by context interaction for breadth of 
search (F,5; = 6.55, p = 0.017). The high-knowledge group used significantly more au- 
thority types in planning than in compliance (t, = 3.34, p = 0.009), and the low-knowledge 
group did not (t,, = 0.81, p = 0.428). Similar results obtain for extent of search. The 
repeated measures ANOVA reveals a significant knowledge by context interaction for extent 
of search (F, 2s = 6.01, p = 0.021). The high-knowledge group accessed significantly more 
authorities in planning than in compliance (t; = 4.37, p = 0.002), but the low-knowledge 
group did not (t,, = 1.55, p = 0.139). 


Hypotheses 3 and 4 

In H3, I predict that information search adaptivity (in the form of broader and more 
extensive search in planning than in compliance) is a characteristic of performance in the 
tax research task. To test this prediction, I regressed the summary measure of adaptivity, 
ADAPTIVITY, on the performance measure, QUALITY. ADAPTIVITY is significant in the 
regression (F,., = 10.60, p = 0.003, R? = 0.29)—performance in the research task in- 
creased with information search adaptivity. 
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TABLE 4 
Breadth and Extent of Search by Knowledge of Justiflability Demands (JUST KNOW") 


Panel A: Number of Source Types 





Planning Compliance t 
High JUST. KNOW (n = 10) 
Mean 5.60 3.00 t, 3.34** 
(SD) (2.46) (1.33) 
Low JUST. KNOW (n = 18) 
Mean 4.61 4.22 tj; = 0.81 
(SD) (2.20) (1.59) 
Panel B: Number of Sources 
High JUST_KNOW (n = 10) 
Mean 25.40 13.80 t, = 437** 
(SD) (9.66) (7.10) 
Low JUST_KNOW (n = 18) 
Mean 22.83 19.61 tj, = 1.55 
(SD) (12.70) (9.93) 


*, ** *** p < 0.05, p < 0.01, and p < 0.001, respectively. 
* JUST .KNOW is the measure of institutional knowledge of justifiability demand differences across planning 
and compliance. 


In H4, I predict that information search adaptivity mediates the relation between insti- 
tutional knowledge and tax research performance. Baron and Kenny (1986) outline four 
conditions that must exist for mediation to be present. First, the initial variable (here, 
institutional knowledge) must be correlated with the outcome variable (here, performance). 
Second, the initial variable must be correlated with the mediator variable (here, adaptivity). 
Third, the mediator variable must be correlated with the outcome variable. Finally, when 
controlling for the effect of the mediator variable, the effect of the initial variable on the 
outcome variable either becomes insignificant (complete mediation) or decreases signifi- 
cantly (partial mediation). To test this predicted relationship, I ran regressions to test each 
of the four necessary conditions (Baron and Kenny 1986). The regressions are reported in 
Table 5. 

A regression with tax research performance, as measured by QUALITY, as the depen- 
dent variable shows a significant effect of institutional knowledge, as measured by INST. 
KNOW (t — 2.18, p — 0.039). A regression with information search adaptivity, as measured 
by ADAPTIVITY, as the dependent variable shows a significant effect of INST. KNOW (t 
= 2.71, p = 0.012). A regression with QUALITY as the dependent variable shows a sig- 
nificant effect of ADAPTIVITY (t = 3.26, p = 0.003). In a regression with QUALITY as 
the dependent variable and ADAPTIVITY and INST. KNOW as the independent variables, 
INST .KNOW is not significant (t — 0.96, p — 0.347). Information search adaptivity com- 
pletely mediates the relation between institutional knowledge and tax research performance 
(Baron and Kenney 1986). 


Alternative Explanations 


If technical knowledge, such as the knowledge of the rules governing partnerships, has 
the same impact on information search as does institutional knowledge and if technical 
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TABLE 5 
Regressions for Mediation Analysis 


Dependent Independent Parameter Standard 





Regression — Variable — — Varisble(s) „Estimate — Error — tvaue | Pr > 
l QUALITY* INST. KNOW* — 8.025 3.686 2.18 — 0.0387 
2 ADAPTIVITY — INST .KNOW 0.545 0.201 271 0.012 
3 QUALITY ADAPTIVITY 9.467 2.907 326 0.003 
4 QUALITY INST KNOW 3.673 3.831 0.96 0.347 
ADAPTIVITY 7.983 3.297 242 0.023 


* QUALITY is the average quality of the planning and compliance memo. 

P INST _ KNOW is the sum of the dichotomous variables AMB/COMP _ KNOW and JUST... KNOW. 

° ADAPTIVITY is the standardized sum of planning and compliance differences across the five measures of 
information search breadth and extent. 


knowledge covaries with institutional knowledge, then failing to control for technical knowl- 
edge would make it impossible to disentangle the effects of the two types of knowledge.!? 
While I did not directly measure technical knowledge of partnership and S-corporation law, 
I collected data with respect to technical-topic education and experience. Exposure to these 
technical topics through education or experience represents the opportunity to acquire tech- 
nical knowledge (Bonner et al. 1992). Half the participants reported prior exposure to at 
least one of the topics in class, training, or practice. 

I first consider the relation of exposure to the technical topics and the general institu- 
tional knowledge measures, AMB/COMP ..KNOW and JUST. KNOW. The 24 frequency 
tables generated by crossing the institutional knowledge and exposure measures show no 
significant patterns (Fisher's Exact Test, lowest p — 0.284). The results of crossing insti- 
tutional knowledge and exposure to the technical topics suggest that technical knowledge 
is not a correlated omitted variable in this study. To further assuage concerns that technical 
knowledge rather than institutional knowledge is driving the results, I ran repeated measures 
MANOVAs with exposure to the technical topic and context as the independent variables. 
For the MANOVAs using exposure to either technical topic, exposure does not interact 
with context for breadth of search (F,,, = 0.16, p = 0.690) or extent of search (F, 5, 
= 0.03, p = 0.869). Similar results obtain for the MANOVAs using exposure to both 
technical topics as an independent variable: exposure does not interact with context for 
breadth of search (F, 2s = 0.03, p = 0.863) or extent, of search (F, 2 = 0.05, p = 0.831). 


V. CONCLUSION 
This study explores the effects of task, context, and decision-maker characteristics on 
information search adaptivity and the relations between institutional knowledge, information 
search adaptivity, and tax research performance. The results are consistent with predictions 
and suggest that tax decision makers respond to features of the tax decision context when 
conducting information search. Consistent with Payne et al.’s (1993) theory that individual 
differences interact with task and context differences to drive adaptivity, those participants 


19 Harlier, I addressed the issue of the relation between technical and institutional knowledge from a theoretical 
standpoint. Here, I report results with respect to the empirical relation between the two constructs in this 
participant group. | 
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who possessed knowledge of differences in ambiguity, complexity, and justifiability de- 
mands across the planning and compliance contexts exhibited the predicted information 
search adaptivity while those without such knowledge did not. Those who knew that plan- 
ning is characterized by greater ambiguity and complexity than is compliance investigated 
more transaction structures, more issues, and a greater percentage of secondary sources in 
planning than in compliance. Those who knew that planning is characterized by greater 
justifiability demands than is compliance used more authority types and accessed more 
authorities in planning than in compliance. Further, consistent with Libby and Luft (1993), 
institutional knowledge is associated with tax research performance as measured by the 
quality of tax research memos and supporting documentation. Finally, the ability to adapt 
information search to the context completely mediates the relation between institutional 
knowledge and tax research performance. 

While I made theory-based predictions that the search of tax professionals without 
institutional knowledge would not differ across the contexts, I did not have theoretical 
grounds to predict what level of search breadth and extent they would exhibit. Interestingly, 
low-knowledge participants' search in both contexts looked most like high-knowledge par- 
ticipants' planning search. That is, low-knowledge participants conducted relatively broad 
and extensive searches in both contexts. The participants' assessed ambiguity, complexity, 
and justifiability demands are generally consistent with this information search pattern. If 
tax professionals conduct information search that is appropriate to planning problems while 
working in the compliance context, then both efficiency and effectiveness may be compro- 
mised. To the extent that search is more extensive than is appropriate for the compliance 
context, efficiency is impacted. This is particularly important as managers in Magro (1999) 
indicated that they would budget significantly less time for compliance research than for 
planning. Effectiveness may be compromised as well because broader search is different in 
focus, and such a search may focus on issues and transaction structures that are relevant 
in a planning context and fail to research relevant compliance issues and transaction struc- 
tures in appropriate depth. 

Limitations of this study provide opportunities for future research. While my partici- 
pants had significantly more time to conduct tax research than did those in prior studies 
(two hours per problem), they had limited time relative to the natural context. Future re- 
search could investigate the effects of time pressure on information search across planning 
and compliance contexts. The measure of performance in this study was average overall 
quality of the tax research memos. Future research could investigate the various components 
of tax research quality and the role of information search therein. Ambiguity, complexity, 
and justifiability demands were manipulated together using a natural manipulation in the 
tax setting—planning versus compliance context. Within context, however, the character- 
istics may vary with details of the client's situation or the technical tax law. Future research 
could manipulate these characteristics within the planning or compliance context to deter- 
mine if the relations between task and context features and adaptivity hold. I rely upon an 
indirect measure of technical knowledge in the supplemental analyses. Technical knowledge 
affects information search strategies (Cloyd 1995a, 1997; Spilker 1995), and the interaction 
of technical and institutional knowledge may yield information search strategies and forms 
of adaptivity not considered in the current study. Future research should consider the relative 
impacts of different knowledge types on information search strategies and tax research 
performance. Finally, planning and compliance contexts are also present in the financial 
accounting setting, but the relative risks and relevant features may differ therein. Future 
research could investigate the differences between tax and financial accounting planning 
and compliance contexts and the effect those differences would have on adaptivity. 
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In this study, I make three main contributions to the Hterature. First, I show that task, 
context, and individual characteristics combine to drive information search strategies in a 
professional setting and that a detailed analysis of the setting is necessary to predict when 
and how decision makers will adapt their decision strategies. Second, this study is the first 
to focus on the importance of institutional knowledge to decision-making process and per- 
formance in accounting. Finally, the results of this study contribute to tax practice. The 
results suggest that information search adaptivity is an important component of tax research 
performance (such adaptivity explained almost 30 percent of the variation in performance) 
and that adaptivity mediates the relation between institutional knowledge and performance. 
Universities and accounting firms can use this information to improve the education and 
training of tax professionals by providing opportunities for them to acquire institutional 
knowledge, including knowledge of both task and context features and appropriate adaptive 
strategies. 
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ABSTRACT: This study provides a theoretical framework and experimental evidence 
on how managers’ disclosure decisions affect their credibility with investors. | find that 
in the short-term, more forthcoming disclosure has a positive effect on management's 
reporting credibility, especially when management is-forthcoming about negative news. 
However, these short-term credibility effects do not persist over time. In the long-term, 
managers who report positive earnings news are rated as having higher reporting cred- 
ibility than managers who report negative earnings news, regardless of their previous 
disclosure decisions. 


Keywords: management credibility; reporting credibility; management disclosure; 
- affect. 


I. INTRODUCTION 
oth anecdotal and academic evidence suggest that management's reporting credibil- 
B ity matters. Articles in the popular press often detail managers' credibility struggles 
and subsequent attempts to regain reporting credibility. The academic literature 
confirms that managers’ credibility concerns are warranted—more credible managers are 
better able to communicate information to the capital markets (Williams 1996; Mercer 
2004). Despite the importance of this topic, there is little systematic evidence on the relation 
between the communications provided by management and the development of reporting 
credibility. My paper attempts to fill this void by providing a theoretical framework and 


! Recent examples include articles on AOL Time Warner management's attempt to "restore the company's cred- 
ibility with investors" (Cullen et aL 2003) and how Honeywell management is “in the credibility penalty box" 
(Barrett 2003). 
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experimental evidence regarding how managers” disclosure decisions affect their credibility 
with Investors. 

Specifically, the paper draws on two theories from psychology—attribution theory and 
affect-based reasoning theory—to develop a model of how the forthcomingness of man- 
agement's disclosures affects management's reporting credibility (i.e., management's per- 
ceived trustworthiness and competence in financial reporting). The model's basic proposi- 
tion is that management's reporting credibility will be based on different factors in the 
short- and long-term. In the short-term, reporting credibility will be determined by investors’ 
cognitive reactions (i.e., thoughts) about the forthcomingness of management's disclosures. 
That is, because investors prefer more forthcoming disclosure, greater forthcomingness will 
increase management’s reporting credibility. According to attribution theory, this effect will 
be especially strong for forthcomingness about negative news. 

In the long-term (i.e., as time passes subsequent to managers’ disclosure decisions), 
investors’ initial cognitions about these decisions will fade. However, affect-based reasoning 
theories suggest that investors’ affective reactions will remain in memory even after the 
related cognitions have faded (Damasio 1994; Kida and Smith 1995). Consequently, the 
model predicts that affective reactions to the news disclosed will determine management’s 
reporting credibility in the long-term. That is, managers who previously reported positive 
news will be perceived as having higher reporting credibility than managers who previously 
reported negative news, regardless of management’s level of forthcomingness about the 
news. 

In short, the model suggests that more forthcoming disclosures, especially those related 
to negative news, will result in short-term boosts to management’s reporting credibility. 
However, as time passes, investors will no longer reward managers for forthcoming disclo- 
sures. Instead, investors’ assessments of management’s reporting credibility will be based 
on the firm's financial performance (i.e., whether the firm reported positive or negative 
news). 

This paper also provides experimental tests of the validity of the proposed credibility 
model. Specifically, I present the results of an experiment in which 244 investors assess 
management’s reporting credibility in various scenarios. The experiment employs a 2 x 2 
x 2 between-subjects design that varies (1) management’s forthcomingness about unex- 
pected earnings news, (2) whether the news is positive or negative, and (3) whether man- 
agement’s reporting credibility is assessed immediately or two weeks after management’s 
disclosure decision. The experimental results are generally consistent with the model’s 
predictions as described above. 

This research is informative on both a theoretical and an applied level. Whereas a 
useful literature has developed pertaining to the consequences of reporting credibility (e.g., 
Williams 1996; Hirst et al. 1999), little theoretical work has been done on the determinants 
of reporting credibility. Given the significant role that management's reporting credibility 
plays in a firm's ability to communicate information to investors (Williams 1996; Mercer 
2004), developing an understanding of the factors that affect management's reporting cred- 
ibility is critical. The present study takes an important step in that direction by developing 
and testing the first theoretically grounded model of how managers' disclosure decisions 
affect their credibility with investors. This model should be of use to future accounting 
researchers studying the determinants of reporting credibility, as well as researchers in other 
fields interested in the development of credibility, such as management, marketing, political 
science, and sociology. | 

At an applied level, the results of this study should be informative to firm managers. 
When trying to build reporting credibility, managers routinely face decisions about the 
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extent to which they should disclose news to investors. My study's results provide guidance 
to managers regarding the credibility consequences of these decisions. Specifically, they 
suggest that there are short-term credibility benefits to more forthcoming disclosure, par- 
ticularly for negative news. This knowledge should be valuable, because reporting credi- 
bility is especially important to managers during certain periods of time, such as when they 
need to raise capital (Healy and Palepu 1995). However, my results also suggest that efforts 
to increase long-term reporting credibility via disclosure policies may be misplaced, as the 
credibility consequences of disclosure decisions are fleeting. 

The paper proceeds as follows. Section II draws on theories from cognitive and social 
psychology to derive a model of how the forthcomingness of management's disclosures 
affects its reporting credibility in the short- and long-term. Section III presents an experi- 
ment that tests the model's predictions, and Section IV reports the experiment's results. 
Section V provides summary remarks and discusses the implications of this research for 
managers and academics. 


H. MODEL DEVELOPMENT 
Overview 


I define management's reporting credibility as investors’ beliefs about management's 
trustworthiness and competence in financial disclosure (Hovland et al. 1953; Giffin 1967).? 
This section draws on several theories from psychology to develop a model of when and 
how management’s disclosure decisions affect its reporting credibility. The model suggests 
that after receiving company news and inferring management’s level of forthcomingness 
about the news, investors will experience both cognitive and affective reactions. Cognitive 
reactions take the form of thoughts (Markman and Gentner 2001) about the news and 
management’s forthcomingness, and affective reactions involve positive or negative eval- 
uative reactions (Slovic et al. 2002) to the news and management’s forthcomingness.? 

The model’s basic proposition is that the credibility consequences of managers’ disclo- 
sure decisions will differ in the short- and long-term. Specifically, the model predicts that 
in the short-term, investors’ cognitive reactions to management’s disclosure decisions de- 
termine the credibility consequences of those decisions. That is, because investors prefer 
more forthcoming disclosures, they will deem managers who provide such disclosures to 
be more credible than managers who do not. Moreover, the model predicts that investors 
will consider the reasons for management’s disclosure decisions and that these thoughts 
will influence the credibility effects of disclosure decisions. 

The model also predicts that as time passes, investors’ cognitive reactions to manage- 
ment’s disclosure decisions will fade. However, affective reactions make stronger imprints 
in memory than cognitive reactions (Damasio 1994; Kida and Smith 1995), so investors 
will retain their overall affective reaction to an event long after their cognitive reactions 
have faded. Thus, when investors assess management’s reporting credibility in the long- 
term, their initial overall affective reaction will form the basis for their assessments. Prior 
literature suggests that investors’ overall reactions will be driven primarily by the valence 
of the news disclosed (i.e., whether the news disclosed was positive or negative) rather than 


* The idea that management's reporting credibility represents investors’ beliefs about management is consistent 
with reputation studies in the economics literature. For example, Axelrod (1984) defines reputation as the 
embodiment of beliefs about the strategy that a particular person will use. 

> The term "affect" has been used to refer to various constructs, including evaluations, moods, and emotions 
(Fiske and Taylor 1991). Consistent with recent articles that examine the role of affect in decision making (e.g., 
Mellers et al. 1997; Finucane et al. 2000), I focus on affect as an evaluation (i.e., as a positive or negative 
emotional response associated with an event). 
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by whether management was forthcoming about the news (Kasznik and Lev 1995). Con- 
sequently, the model predicts that, in the long-term, management's reporting credibility will 
be a function of whether the firm reported positive or negative news, rather than manage- 
ment's forthcomingness about the news. These short- and long-term credibility effects are 
described in more detail below. 


Short-Term Credibility Effects 

The model's predictions regarding the short-term credibility effects of managers' dis- 
closure decisions center on investors' cognitions about management's forthcomingness. 
However, before describing these predictions, it is important to consider the factors that 
influence investors' perceptions of disclosure forthcomingness. Three facets of a firm's 
disclosures appear to be important predictors of perceived disclosure forthcomingness—the 
accuracy, completeness, and timeliness of a firm's disclosures. Disclosure accuracy is the 
most obvious factor in the perceived forthcomingness of management disclosures. Ceteris 
paribus, disclosures that are revealed to be inaccurate are more likely to be perceived as 
nonforthcoming than more accurate disclosures (e.g., Barrett 2003; Rama 2003). However, 
the completeness and timeliness of management's disclosures also influence perceptions of 
forthcomingness. For example, a company's annual report might report accurate overall 
financial results but not disclose enough detail for investors to determine the performance 
of certain critical business segments. Investors may regard such incomplete disclosures as 
nonforthcoming despite their accuracy (e.g., Silverman 2002; Steinberg 2003). Investors 
also may regard untimely disclosures as nonforthcoming. For example, firms that fail to 
provide earnings warnings are often perceived as less forthcoming than firms that provide 
such disclosures (e.g., Elgin 2003; Olver 2003). 

The few academic studies that consider the relation between disclosure forthcomingness 
and management’s reporting credibility focus on the short-term credibility effects of forth- 
coming disclosures (Libby and Tan 1999; Tan et al. 2002). Both Libby and Tan (1999) and 
Tan et al. (2002) ask investors to rate management's credibility immediately after receiving 
information about management's forthcomingness. These studies show that investors rate 
managers who provide forthcoming disclosures as more credible than managers whose 
disclosures are less forthcoming. Specifically, Libby and Tan (1999) find that managers who 
provide warnings about unexpected earnings are judged to have higher integrity than man- 
agers who do not provide such disclosures. Tan et al. (2002) manipulate the forthcomingness 
of management's disclosures by varying disclosure accuracy rather than disclosure timeli- 
ness and find similar results. That is, managers who accurately forecast earnings are rated 
as more competent and trustworthy than managers who significantly overstate or understate 
earnings. In short, these studies find that forthcoming managers have higher credibility than 
nonforthcoming managers, suggesting that disclosure forthcomingness affects manage- 
ment's reporting credibility, at least in the short-term. This leads to the following hypothesis: 


Hila: In the short-term, forthcoming disclosure will increase management's reporting 
credibility and a lack of forthcomingness will decrease management's reporting 
credibility. 


Attribution theory from social psychology provides a basis for refining H1a. Attribution 
theory describes how people impute the causes of others' behaviors (Fiske and Taylor 1991). 
A key tenet of attribution theory is that when situational explanations for an actor's behavior 
are not apparent, people are more likely to attribute the behavior to an enduring dispositional 
trait of the actor (Jones and Davis 1965). When situational forces are apparent, people are 
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less likely to make a dispositional attribution. In the management disclosure context, attri- 
bution theory predicts that disclosures that are at odds with managers' personal incentives 
are more likely to be attributed to a dispositional characteristic (e.g., the manager's honesty 
or trustworthiness) than disclosures that are consistent with the managers' incentives. In 
other words, because managers tend to have fewer personal incentives to disclose negative 
news than positive news (Dimma 1996; Hughes 1997), investors are more likely to attribute 
forthcoming negative news disclosures to management's honesty and trustworthiness than 
forthcoming positive news disclosures. 

Research in attribution theory also suggests that investors will spend more time thinking 
about negative outcomes (Weiner 1985), and thus are more likely to think about the reasons 
for negative news disclosure decisions. Consequently, investors are more likely to think 
about management's role in negative news disclosure decisions than positive news disclo- 
sure decisions (Ybarra 2002). Such additional thought 1s likely to (1) heighten the favorable 
dispositional attributions investors may already be inclined to make regarding managers 
who are forthcoming about negative news, and (2) heighten the unfavorable dispositional 
attributions investors may be inclined to make regarding managers who are not forthcoming 
about negative news. With these principles in mind, I expect that managers will receive 
more credit for forthcoming disclosure and more blame for nonforthcoming disclosure when 
the disclosure pertains to negative news. This reasoning leads to Hypothesis 1b: 


Hib: In the short-term, forthcoming disclosure of negative news will increase manage- 
ment's reporting credibility more than forthcoming disclosure of positive news. 
A lack of forthcomingness about negative news will decrease management's re- 
porting credibility more than a lack of forthcomingness about positive news. 


A graphical summary of the short-term credibility predictions is provided in Panel A 
of Figure 1. 


Long-Term Credibility Effects 


Central to my model is the claim that the short-term credibility effects predicted in Hla 
and H1b——effects that are driven by investors’ cognitive reactions to management’s disclo- 
sure decisions—may not hold over time. Rather, in the long-term, perceptions of manage- 
ment's reporting credibility will be driven primarily by investors' affective reactions to the 
news disclosed. In making this prediction, I draw on an affect-based model of financial 
decision-making proposed by Kida and Smith (1995). 

Kida and Smith (1995) argue that when investors experience an event, both the event 
and the overall affective reaction to that event are encoded in memory. Further, they posit 
that affective reactions create stronger imprints in memory than the specific events under- 
lying those reactions. As time passes subsequent to a disclosure decision, investors' mem- 
ories of the detailed information underlying their credibility assessments will fade. However, 
Kida and Smith's (1995) research suggests that investors' affective reactions will be acces- 
sible long after the specific details underlying them have faded. Consequently, when inves- 
tors assess management's reporting credibility in the long-term, they will reconstruct the 
details of managers' disclosure decisions to be consistent with their initial overall affective 
reaction (Clore et al. 1994; Kida and Smith 1995). Thus, investors' long-term credibility 
assessments will be driven by their initial overall affective reaction. 

This argument suggests that the long-term credibility consequences of managers' dis- 
closure decisions will depend on the source of investors' overall affective reactions. After 
receiving news and inferring the management's forthcomingness about the news, investors 
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FIGURE 1 
Predicted Changes m Management’s Reporting Credibility as a Function of News Valence 
and Management’s Forthcomingness 


Panel A: Short-Term Predictions 
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Panel B: Long-Term Predictions 
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Figure ] summarizes the predicted short-term (Panel A) and long-term (Panel B) change in management's 
reporting credibility as a function of news valence and management's forthcomingness. Values above (below) the 
dotted line represent predicted increases (decreases) in management’s reporting credibility. 





will experience affective reactions to (1) the valence of the news itself (positive or negative), 
and (2) management's forthcomingness about the news (forthcoming or not forthcoming). 
Positive news will result in a positive affective reaction, while negative news will result in 
a negative affective reaction. Similarly, forthcoming disclosure will result in positive affect, 
and a lack of forthcomingness will result in negative affect. In some cases, these affective 
reactions will be in conflict, such as when there is negative news (which results in negative 
affect) but management is forthcoming about the news (which results in positive affect). 
Investors' overall affective reaction to these events will depend on which of these two 
affective reactions is stronger (Damasio 1994). A paper by Kasznik and Lev (1995) provides 
reason to believe that investors' affective reactions to news valence will be stronger than 
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their affective reactions to management's forthcomingness.* Specifically, Kasznik and Lev 
(1995) find that earnings news is a stronger predictor of stock market reactions than whether 
management was forthcoming about the news (i.e., whether management provides an earn- 
ings warning). Consequently, I expect that positive (negative) news will result in positive 
(negative) overall affect being encoded in memory, regardless of management's forthcom- 
ingness about the news. 

If investors' overall affective reactions are the primary driver of their long-term credi- 
bility judgments and if news valence determines these affective reactions, then the long- 
term credibility consequences of managers' disclosure decisions will depend on the valence 
of the news disclosed. As a result, more forthcoming disclosures will not necessarily result 
in greater reporting credibility in the long-term. Rather, managers of firms that report pos- 
itive news will have greater reporting credibility than managers of firms that report negative 
news, regardless of the managers' forthcomingness. 

To summarize, as time passes subsequent to managers' disclosure decisions, investors' 
cognitions about these decisions will fade. Consequently, investors who assess manage- 
ment's reporting credibility at a later date will base their assessments on their initial overall 
affective reaction rather than these cognitions. The model also predicts that an investor's 
overall affective reaction will be driven by the valence of the news (positive or negative) 
rather than management's forthcomingness about the news. As a result, in the long-term, 
investors’ perceptions of management's reporting credibility will be based on whether the 
firm previously reported positive or negative news. That is, managers who reported positive 
news will be rewarded with increased credibility and managers who reported negative news 
will be penalized with decreased credibility. In sharp contrast to the model's short-term 
predictions, management's forthcomingness is not expected to be the primary driver of its 
reporting credibility in the long-term. This reasoning is summarized in Hypothesis 2 and 
can be seen graphically by comparing Panels A and B of Figure 1. 


H2: In the long-term, managers who previously reported positive news will experience 
increases in credibility and managers who previously reported negative news will 
experience decreases in credibility, regardless of their perceived forthcomingness. 


IIL EXPERIMENT 
Participants 


To test my hypotheses, I conducted an experiment with 244 M.B.A. students at a large 
state university. Participants were recruited via an email notice and were paid a flat wage 
of $10 plus the chance to win additional money in a random lottery. On average, study 
participants had six years of work experience and had completed three accounting and two 
finance classes; most (83 percent) had previously invested in individual common stocks. T 
used M.B.A. students as participants in this study because their business experience, in- 
vesting experience, and coursework should provide them with a basis for understanding 
managers' disclosure incentives (Elliott et al. 2004). Previous research indicates that some 
understanding of the incentives managers have to disclose or withhold information is nec- 
essary for the presentation of the hypothesized short-term credibility effects (Robertson and 
Rossiter 1974). For example, if participants do not understand that managers have incentives 


* If this prediction is incorrect, and affective reactions to management's forthcomingness are the primary deter- 
minant of investors' overall affective reactions, then H2 will not hold. In this case, management's forthcoming- 
ness, rather than news valence, will drive reporting credibility in the long-term. 
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to withhold Information, then they will not reward forthcoming managers with short-term 
increases in reporting credibility. 


Experimenta! Design and Task 

The experiment uses a 2 X 2 X 2 between-subjects design that varies the following 
three variables: management's forthcomingness about unexpected earnings news (Manage- 
ment Forthcomingness: Low, High), the valence of the earnings news (News Valence: Pos- 
itive, Negative), and the amount of time between reception of the earnings news and par- 
ticipants' assessments of management's reporting credibility (Time: Short-term, Long-term). 
An experimental approach to investigations of the credibility effects of disclosure forth- 
comingness has several advantages. First, using an experiment allows me to hold constant 
factors other than management’s forthcomingness. Such control is important when studying 
the determinants of reporting credibility because firms with highly credible managers are 
fundamentally different in terms of earnings and returns than those with less credible man- 
agers (Lang and Lundholm 1993). Second, experimentation allows for the collection of data 
on the credibility effects of specific disclosure decisions. Archival researchers can use fi- 
nancial analysts' ratings of a firm's disclosures (AIMR reports) to measure managers' re- 
porting credibility. However, AIMR reports are annual ratings, and the time lag between a 
particular management disclosure and the subsequent AIMR report makes it difficult to use 
these reports to infer the credibility effects of specific disclosures. 

Experimental participants assumed the role of investment club members who had re- 
cently purchased common stock in Dentex, a dental supply company. Each participant was 
given background information about Dentex, inclucing excerpts from management's dis- 
cussion and analysis and historical financial statement data. Participants also were informed 
that the consensus analyst forecast for Dentex's upcoming quarterly earnings was $0.52. 
After reviewing this information, participants provided pre-test assessments of manage- 
ment's reporting credibility. 

Participants in the High Management Forthcomingness condition then received a vol- 
untary disclosure from management stating that actual quarterly earnings were expected to 
differ from the consensus forecast. Specifically, participants in the Positive (Negative) News 
Valence condition received the following managemeat disclosure. 


On March 15, Dentex management voluntarily issued the following press release: 

In a presentation to industry analysts today, Dentex, Inc. CEO Murray Levine 
indicated that the company currently expects earnings per share will be $0.60 ($0.44) 
for the quarter ending March 31. This estimate is $0.08 above (below) the current 
consensus analyst forecast of $0.52.° 


Participants in the Low Management Forthcomingness condition did not receive this 
disclosure. 

Subsequently, all participants answered a number of demographic questions regarding 
their investing experience, work experience, and prior coursework. After providing these 
assessments, participants received information about the firm's actual quarterly earnings. 


* Brown (2001) finds that approximately 75 percent of profit firms and 43 percent of loss firms report earnings 
that are less than $0.05 from the consensus analyst forecast. Thus, the amount of unexpected earnings in this 
study ($0.08) is larger than average. Relatively large unexpected earnings should enhance differential perceptions 
of management's forthcomingness across the High Forthcomingness and Low Forthcomingness conditions (Tan 
et al. 2002; Libby et al. 2002, 798). Smaller unexpected earnings may yield smaller effect sizes but should show 
the same overall pattern of responses. 
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Actual earnings were $0.60 in the Positive News Valence condition and $0.44 in the Neg- 
ative News Valence condition. Participants then answered several manipulation check and 
affective reaction questions. 

Participants in the Short-term condition answered additional questions immediately af- 
ter viewing actual earnings. Specifically, they re-assessed management's reporting credibil- 
ity, answered questions about their cognitive processing, and indicated their willingness to 
rely on a subsequent forecast issued by management. Long-term participants answered these 
same questions two weeks after completing the experimental materials. I chose a two-week 
time delay because two weeks is a sufficient amount of time for memory reconstruction to 
commence (Spiro 1980; Kida et al. 1998), but not so long that it threatens the participant 
response rate.° 


Dependent Variables 
Management's Reporting Credibility 

I drew six questions from two widely accepted source credibility scales—McCroskey 
(1966) and Leathers (1992)—to measure management’s reporting credibility (see Appen- 
dix). Three questions related to management's competence in financial reporting. These 
questions asked investors to assess management's competence, knowledge, and qualifica- 
tions for providing financial disclosures. Three additional questions related to management's 
trustworthiness in financial reporting. These questions focused on investors' perceptions of 
management's trustworthiness, honesty, and truthfulness in its financial disclosures. 

To ensure that these questions capture one underlying credibility construct, I conducted 
a reliability analysis on participants’ responses. The results of this analysis yielded a 
Cronbach's alpha of 0.78, suggesting that the scale is reliable (Nunnally 1978, 245). Con- 
sequently, I formed a composite measure of management's reporting credibility by summing 
participants’ responses to the six questions.’ Because the hypotheses are framed in terms 
of changes in management's reporting credibility, I then calculated a credibility difference 
score for each participant (i.e., post-test composite reporting credibility minus pre-test com- 
posite reporting credibility). 

As noted previously, prior research has demonstrated the important consequences of 
management's reporting credibility. These studies show that investors are more likely to 
rely on the disclosures of highly credible managers, making it easier for these managers to 
communicate information to the capital markets (Williams 1996; Hirst et al. 1999). The 
present paper examines the determinants of management's reporting credibility, with an 
emphasis on how the forthcomingness of management disclosures affects reporting credi- 
bility. Consequently, my primary dependent variable is changes in management's reporting 
credibility. However, to ensure that participants' ratings of reporting credibility have sig- 
nificant consequences, I also examine whetber changes in perceived reporting credibility 
affect participants” willingness to rely on subsequent management disclosures. Specifically, 
participants were shown a management earnings forecast for a subsequent quarter and asked 
to express their willingness to rely on this forecast when predicting earnings. À positive 
news earnings forecast was chosen for this task because prior research shows that reporting 


6 Although I only use Long-term participants’ two-week-later responses, I asked all participants to return after 
two weeks to prevent them from guessing the manipulations. To motivate participants to return, those who 
completed both parts of the study were entered into a $1500 lottery ($1000 first prize and $500 second prize). 
Approximately 89 percent of participants returned after two weeks. 

? Inferentially identical results appear when management's reporting credibility is measured with credibility factor 
scores derived by conducting a factor analysis on participants' responses to the six credibility questions. 
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credibility is especially important for increasing the believability of positive news disclo- 
sures (Williams 1996). 


Process Measures 

My model also identifies the mechanisms through which managers' disclosure decisions 
affect reporting credibility. The model proposes that the short-term credibility consequences 
of managers' disclosure decisions are driven primarily by investors' cognitive reactions—in 
particular, the attributions they make for managers' disclosure decisions. To measure par- 
ticipants' attributions, I provided a list of six potential explanations for management's dis- 
closure decision: pressure from analysts and investors, self-interest, management's honesty, 
predictability of the firm's earnings, a desire for the firm's stock price to be consistent with 
economic reality, and legal liability concerns. Participants ranked these explanations from 
] to 6, where 1 (6) was the factor that they believed was the most (least) important cause 
of management's disclosure decision. Because my theoretical framework suggests that dis- 
positional attributions lead to changes in management's perceived credibility, I am primarily 
interested in attributions to “‘management’s honesty/dishonesty." I use the relative rank of 
this variable as my primary measure of investors' cognitive reactions. 

The model predicts that investors' long-term credibility judgments are driven by their 
affective reactions to the news valence, rather than their cognitive reactions to manage- 
ment's disclosure decisions. Consistent with prior research on the role of affect in decision- 
making (Mellers et al. 1997; Kida et al. 1998), I measured participants” affective reactions 
to the news valence by asking them to indicate their agreement with the following statement, 
"The unexpected earnings caused me to feel good." Participants indicated their agreement 
on a seven-point Likert-type scale, with endpoints 1 = Strongly Disagree and 7 = Strongly 
Agree. Responses to this question capture both the direction and strength of participants' 
affect. That is, responses above (below) the scale midpoint indicate positive (negative) 
affect; the higher (lower) the number, the more positive (negative) the affective reaction." 


IV. RESULTS 
Manipulation Checks 


Responses to the manipulation check questions indicate that all manipulations were 
successful. Among participants in the High (Low) Management Forthcomingness condition, 
98 percent (98 percent) correctly indicated that they did (did not) receive a warning about 
the company's unexpected earnings. Among participants in the Positive (Negative) News 
Valence condition, 100 percent (99 percent) correctly indicated the direction of the unex- 
pected earnings news. Excluding participants who failed one or more of the manipulation 
checks does not materially change any of the results reported below. 


Effects of Management’s Disclosure Decisions on Its Reporting Credibility 

The proposed model makes a number of specific predictions about the short- and long- 
term credibility effects of managers’ disclosure decisions. Underlying these predictions is 
the assumption that the credibility consequences of disclosure decisions differ in the short- 
and long-term. Thus, before testing the model’s specific predictions, I test this general 


o 


Participants also evaluated the statement, “The unexpected earnings caused me to feel bad" on a similar Likert- 
type scale (1 = Strongly Disagree and 7 = Strongly Agree). Participants’ responses to this “feel bad" question 
are inversely related to their responses to the “feel good" question, suggesting that the “feel good" question is 
able to capture both positive and negative affect. 
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assumption. Specifically, I conduct a three-way ANOVA with Management Forthcoming- 
ness, News Valence, and Time as independent variables and participants' credibility differ- 
ence scores as the dependent variable. As predicted, the ANOVA reveals a significant three- 
way interaction term (F — 4.14, two-tailed p — 0.04). This result suggests that the credibility 
effects of more forthcoming disclosure differ in the short- and long-term. I discuss these 
short- and long-term effects in more detail in the following two sections. 


Short-Term Credibility Effects | 

Recall that the model offers two hypotheses regarding the short-term credibility effects 
of management's disclosure decisions: Hypothesis 1a predicts that forthcoming disclosures 
will increase management’s reporting credibility and a failure to provide forthcoming dis- 
closures will decrease management's reporting credibility. Hypothesis 1b predicts that these 
effects will be more pronounced for negative news than for positive news. To test these 
hypotheses, I first conduct an omnibus two-way ANOVA with Management Forthcoming- 
ness and News Valence as independent variables and Short-term participants' credibility 
difference scores as the dependent variable (see Table 1). The ANOVA shows a statistically 
significant main effect for Management Forthcomingness (F — 52.08, two-tailed p « 0.01) 
and a statistically significant Management Forthcomingness x News Valence interaction 
(F = 5.17, two-tailed p = 0.03). These results suggest that decisions about whether to 
provide forthcoming disclosures influence management's reporting credibility and that the 
magnitude of this influence differs for positive and negative news. 

If forthcoming disclosure increases reporting credibility and failure to provide forth- 
coming disclosure decreases reporting credibility as proposed in Hla, then credibility dif- 
ference scores for participants in the High (Low) Management Forthcomingness conditions 
should be positive (negative). In support of Hla, credibility difference scores in the High 
Management Forthcomingness conditions are greater than zero (mean = 2.63, t = 6.07, 
one-tailed p « 0.01), and credibility difference scores in the Low Management Forthcom- 
ingness conditions are less than zero (mean — —2.73, t — 4.43, one-tailed p « 0.01). These 
relations hold across both Positive and Negative levels of News Valence. That is, credibility 
difference scores in the High Management Forthcomingness condition are greater than zero 
for both Positive News (mean = 2.15, t = 3.76, one-tailed p < 0.01) and Negative News 
(mean = 3.13, t = 4.82, one-tailed p < 0.01), whereas credibility difference scores in the 
Low Management Forthcomingness condition are less than zero for both Positive News 
(mean = —1.52, t = 2.77, one-tailed p < 0.01) and Negative News (mean = —3.94, t 
— 3.68, one-tailed p « 0.01). 

Hypothesis 1b predicts that the effects of management's forthcomingness on its re- 
porting credibility will be more pronounced for negative news. The data provide partial 
support for H1b. Participants’ credibility difference scores decreased more in the Low Man- 
agement Forthcomingness/Negative News condition (mean = -—3.94) than in the Low 
Management Forthcomingness/Positive News condition (mean = —1.52) (t = 2.01, one- 
tailed p = 0.03).? Further, participants” credibility differences scores increased more in the 
High Management Forthcomingness/Negative News condition (mean = 3.13) than in 


? Tables 1 and 2 show unequal variances across the experimental conditions. I therefore use t-tests that do not 
assume equal variances when analyzing differences across conditions. 
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TABLE 1 
Short-Term Effects of News Valence and Management's Forthcomingness on Management's 
Reporting Credibility 
Panel A: Mean (Standard Deviation) Changes in Management's Reporting Credibility 
Positive Negative 
News News 
Low Management — 1.52 —3.94 — 2.73 
Forthcomingness (3.11) (6.06) (4.94) 
l n = 32 n = 32 n = 64 
High Management 2.15 3.13 2.63 
Forthcomingness (3.24) (3.56) (3.41) 
n = 32 n = 30 n = 62 
0.32 —0.52 
(3.66) (6.12) 
= 64 n = 62 


Panel B: ANOVA Results f 
df MSE F-statistic p-value 


Management Forthcomingness 1 911.28 52.08 .00 
News Valence 1 16.43 0.94 .34 
Management Forthcomingness X News Valence 1 90.577 5.17 .03 


Table 1 shows the short-term effects of news valence and management's forthcomingness on management's 
reporting credibility. Prior to viewing the experimental manipulations, participants provided pre-test assessments 
of management's reporting credibility by answering six questions (see Appendix). À composite measure of 
management's reporting credibility was computed by summing participants' responses to these questions. 
Participants in the High Management Forthcomingness condition were then provided with a voluntary disclosure 
from management stating that actual quarterly earnings were expected to differ from the consensus analyst 
forecast. Specifically, participants in the Positive (Negative) News condition were told that earnings per share 
was expected to be $0.08 above (below) the consensus analyst forecast. Participants in the Low Management 
Forthcomingness condition did not receive this disclosure. Subsequently, all participants were provided with the 
firm's actual quarterly earnings, which were $0.08 above (below) the consensus forecast for Positive (Negative) 
News participants. Immediately after viewing these materials, participants provided post-test assessments of 
management's reporting credibility by answering the same credibility questions used in the pre-test. Panel A 
reports the change in management's reporting credibility (computed as post-test composite reporting credibility 
minus pre-test composite reporting credibility) by experimental condition. Panel B reports an ANOVA that tests 
the short-term effects of news valence and management's forthcomingness on changes in management's 
reporting credibility. 


the High Management Forthcomingness/Positive News condition (mean — 2.15), though 
this difference did not reach statistical significance (t = 1.12, one-tailed p = 0.13).’° 

To summarize, I find that in the short-term, management's decisions about whether to 
provide forthcoming disclosures affect its reporting credibility. Forthcoming disclosure re- 
sults in increased reporting credibility and a lack of forthcoming disclosure results in de- 
creased reporting credibility. Negative news disclosure decisions appear to be especially 


© [f participants were more surprised by the negative earnings news than the positive earnings news, then it is 
possible that the expeciedness of the news, rather than the news valence, per se, drove their responses. To test 
for this possibility, I asked participants to evaluate the expectedness of the earnings information. The results do 
not indicate that participants in the Negative News condition were significantly more surprised by the unexpected 
earnings than participants in the Positive News condition (t = 0.47, two-tailed p = 0.64). Therefore, the ex- 
pectedness of the news cannot explain the observed effects of news valence. 
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important in determining reporting credibility—managers who are not forthcoming about 
negative news experience greater credibility losses than managers who are not forthcoming 
about positive news. A graphical summary of these effects appears in Panel A of Figure 2. 


Long-Term Credibility Effects 

The model also predicts that the credibility consequences of disclosure forthcomingness 
will be short-lived. Specifically, the model predicts that news valence (i.e., whether the firm 
reports positive or negative news), rather than management's forthcomingness, will drive 





FIGURE 2 
Changes in Management's Reporting Credibility as a Function of News Valence and 
Management’s Forthcomingness 


Panel A: Changes in Management's Reporting Credibility in the Short-Term 
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Panel B: Changes in Management's Reporting Credibility in the Long-Term 
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Figure 2 reports the mean short-term (Panel A) and long-term (Panel B) change in management’s reporting 
credibility by experimental condition. Prior to exposure to the experimental manipulations (see Table 1 for 
descriptions), participants provided pre-test assessments of management’s reporting credibility by answering six 
credibility questions (see Appendix). A composite measure of management’s reporting credibility was computed 
by summing each participant’s responses to these questions. Participants in the Short-term conditions answered 
the questions a second time immediately following exposure to the experimental manipulations, while 
participants in the Long-term conditions provided post-test assessments two weeks after exposure to the 
experimental manipulations. Credibility difference scores were computed by subtracting participants’ pre-test 
composite credibility assessments from their post-test composite credibility assessments. Positive (negative) 
credibility difference scores indicate increases (decreases) in management’s reporting credibility. 
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management's reporting credibility in the long-term. Consequently, H2 predicts that man- 
agers who report positive news will experience long-term credibility gains and managers 
who report negative news will experience long-term credibility losses, regardless of their 
forthcomingness about the news. In statistical terms, this suggests that a two-way ANOVA 
on Long-term participants' credibility difference scores will yield a statistically significant 
main effect for News Valence only. The data support this prediction (see Table 2). The 
ANOVA yields a significant main effect for News Valence (F = 16.77, two-tailed p < 0.01) 
and insignificant effects for Management Forthcomingness (F = 0.07, two-tailed p = 0.79) 
and the Management Forthcomingness x News Valence interaction term (F = 0.57, two- 
talled p = 0.45). These results suggest that News Valence is the primary predictor of 
participants' beliefs about management's reporting credibility in the long-term. 

If, as suggested in H2, reporting positive news leads to long-term credibility increases 
and reporting negative news leads to long-term credibility decreases, then the credibility 
difference scores for participants in the Positive (Negative) News conditions should be 
positive (negative). The data support H2 across both High and Low levels of Management 
Forthcomingness. In the long-term, Positive News participants” credibility difference scores 
are greater than zero, regardless of whether management was forthcoming (mean — 2.44, 
t — 3.11, one-tailed p « 0.01) or not forthcoming (mean — 1.49, t — 2.89, one-tailed p 
< 0.01) about the positive news. This result suggests that positive news has positive effects 


TABLE 2 
Long-Term Effects of News Valence and Management’s Forthcomingness on Management's 
Reporting Credibility 
Panel A: Mean (Standard Deviation) Changes in Management’s Reporting Credibility 
Positive Negative 
News News 
Low Management 1.49 1.61 —0.21 
Forthcomingness (2.53) (4.89) (4.26) 
n = 24 n = 29 n = 53 
High Management 244 —2.06 0.27 
Forthcomingness (4.16) (6.57) (5.86) 
n = 28 n = 26 n= 4 
2.00 —1.83 
(3.50) (5.70) 
n = 52 n = 55 
Panel B: ANOVA Results 
df MSE F-statistic p-value 
Management Forthcomingness 1 1.69 0.07 .79 
News Valence 1] 385.23 16.77 .00 
Management Forthcomingness x News Valence i 13.02 0.57 AS 


Table 2 presents the long-term effects of news valence and management’s forthcomingness on management’s 
reporting credibility. Participants in the Long-term conditions made post-test assessments of management’s 
reporting credibility two weeks after viewing the experimental manipulations. See Table 1 for descriptions of the 
manipulations. Panel A reports the change in management's reporting credibility (computed as post-test 
composite reporting credibility minus pre-test composite reporting credibility) by experimental condition. Panel 
B reports an ANOVA that tests the long-term effects of news valence and management’s forthcomingness on 
changes in management’s reporting credibility. 
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on management's reporting credibility—even when management is not forthcoming about 
the news. Perhaps more importantly, the credibility difference scores of participants in the 
Positive News/High Management Forthcomingness condition are not significantly greater 
than those in the Positive News/Low Management Forthcomingness condition (t = 1.01, 
two-tailed p — 0.32). This result suggests that there are few, if any, long-term credibility 
consequences to withholding positive news. 

Negative News participants report credibility difference scores that are less than zero 
both when management is forthcoming (mean — —2.06, t — 1.60, one-tailed p — 0.06) and 
when management is not forthcoming (mean = —1.61, t = 1.78, one-tailed p = 0.04). In 
other words, managers who ultimately report negative news lose reporting credibility, 
regardless of their forthcomingness. In addition, the credibility difference scores of par- 
ticipants in the Negative News/High Management Forthcomingness condition are not sig- 
nificantly greater than those of participants in the Negative News/Low Management Forth- 
comingness condition (t — 0.28, two-tailed p — 0.78). Thus, it appears that there are no 
long-term credibility benefits to being forthcoming about negative news. 

In sum, I find that in the long-term, management's reporting credibility is unrelated to 
the forthcomingness of its disclosures. Instead, management's reporting credibility is driven 
by the valence of the news that is ultimately reported. Specifically, managers who report 
negative news experience decreased reporting credibility, and managers who report positive 
news experience increased reporting credibility, regardless of their forthcomingness about 
the news. Panel B of Figure 2 summarizes these long-term credibility effects. 


Understanding the Process 
Overview 

The model not only predicts that the credibility consequences of managers' disclosure 
decisions will differ in the short- and long-term, but it also identifies factors that may 
explain why these effects occur. Specifically, the model posits that investors' cognitive 
reactions to disclosure decisions will drive their perceptions of management's reporting 
credibility in the short-term, but that investors' affective reactions to the news valence will 
drive their perceptions of management's reporting credibility in the long-term. I test these 
hypotheses about the determinants of management's reporting credibility below. In addition, 
I explore the potential consequences of changes in management's reporting credibility. 
Specifically, I examine whether changes in management's reporting credibility are reflected 
in investors’ willingness to rely on management's subsequent disclosures. 

I use the AMOS path analysis program (Arbuckle 1999) on data collected in my ex- 
periment to simultaneously test these determinants and consequences. Path analysis extends 
the regression model by allowing for simultaneous examination of both direct and indirect 
effects among a set of variables (Kline 1998). This technique is used to assess the overall 
fit of a causal model as well as the significance of specific links within the model. In the 
present context, path analysis provides a parsimonious and powerful method for identifying 
the roles that investors' cognitive and affective reactions play in their beliefs about man- 
agement's reporting credibility in the short- and long-term, and whether changes in 
management's reporting credibility, in turn, affect investors’ willingness to rely on man- 
agement's subsequent disclosures. 


Determinants of Reporting Credibility 

The model predicts that investors will attribute forthcoming management disclosures to 
management's dispositions, and that these cognitions will lead to short-term increases in 
management’s reporting credibility. However, because cognitions fade from memory over 
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time, the model offers a different prediction about the determinants of management’s re- 
porting credibility in the long-term. Investors' affective reactions, which are driven largely 
by news valence, make stronger imprints in memory than their cognitive reactions. Con- 
sequently, the model also predicts that investors' affective reactions to the company's 
news—rather than their cognitive reactions to management’s forthcomingness about the 
news—will drive investors' beliefs about management's reporting credibility in the long- 
term. 

I test these predictions by conducting two path analyses (see Figure 3). These analyses 
test the effects of management's forthcomingness about positive and negative news on 
participants' cognitive and affective reactions, and the effects of these reactions, in turn, on 
participants” credibility difference scores. The first path analysis examines Short-term par- 
ticipants’ responses (Panel A), and the second path analysis examines Long-term partici- 
pants' responses (Panel B). 

The data appear to fit the model well. Comparative fit indices (CFIs) greater than 0.90 
suggest good model fit (Kline 1998, 131), and the Short- and Long-term path models have 
CFIs of 0.99 and 0.98, respectively. Further, nested models confirm that the overall path is 
different in the Short- and Long-term (x? = 5.78, two-tailed p = 0.06). 

Results for the individual model links are also as predicted. Panel À of Figure 3 shows 
a statistically significant link between Short-term participants' cognitive reactions and their 
credibility difference scores (B — 0.25, two-tailed p « 0.01). Thus, as expected, cognitive 
reactions are a significant predictor of management’s reporting credibility in the short-term. 
Also as expected, the link between Short-term participants' affective reactions and their 
credibility difference scores is not significant (B = 0.12, two-tailed p > 0.10). This result 
suggests that affective reactions are not a significant predictor of management's reporting 
credibility in the short-term. 

The model predicts that affective reactions will be the primary determinant of partici- 
_ pants’ credibility assessments in the long-term. In support of this hypothesis, Panel B of 
Figure 3 shows a statistically significant link between Long-term participants' affective 
reactions and their credibility difference scores (B = 0.27, two-tailed p < 0.01). The link 
between Long-term participants” cognitive reactions and their credibility difference scores 
is not statistically significant (B = —0.08, two-tailed p > 0.10). These results indicate that 
affective reactions—rather than cognitive reactions—are the primary driver of Long-term 
participants' beliefs about management's reporting credibility. 

Taken together, the path analyses provide strong support for the model. Investors' cog- 
nitive reactions to management’s disclosure decisions influence investors’ beliefs about 
management's reporting credibility in the short-term, but not in the long-term. In contrast, 
investors' affective reactions influence their beliefs about management's reporting credibility 
in the long-term, but play little or no role in the short-term. Thus, the path analyses support 
the model's contention that managers' disclosure decisions have different credibility con- 
sequences in the short- and long-term because investors' credibility assessments are in- 
formed by different factors over time. 


Consequences of Reporting Credibility 

The path analyses also examine the consequences of gains or losses to management's 
reporting credibility. Prior studies suggest that management's reporting credibility influences 
its ability to communicate information to investors. Specifically, investors are more likely 
to rely on disclosures from highly credible managers (Williams 1996; Hirst et al. 1999). 
To examine whether the credibility effects demonstrated in this study are consequential, all 
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FIGURE 3 
Path Analysis Results—Determinants and Consequences of Management's 
Reporting Credibility 


Panel A: Determinants and Consequences of Reporting Credibility in the Short-Term 


0.36*** 





Panel B: Determinants and Consequences of Reporting Credibility in the Long-Term 


Willingness 
to Rely on 


Subsequent 


Disclosure 





*** Indicates p « 0.01 two-tailed. 

NS Indicates p 0.10 two-tailed. 

Figure 3 reports the standardized regression coefficients for two path analyses examining (1) the determinants of 
management's reporting credibility i in the short- and long-term, and (2? whether changes in management’s 
reporting credibility affect investors' willingness to rely on management's subsequent disclosures. Participants' 
cognitive reactions were measured by asking them to indicate the degree to which they attributed management's 
disclosure decision to management's honesty. Participants' affective reactions were measured by asking 
participants to assess the direction and strength of their feelings related to the news valence. The change in 
management's reporting credibility was calculated by subtracting participants’ pre-test composite credibility 
assessments from their post-test composite credibility assessments (see Appendix). Willingness to rely was 
measured by providing participants with an additional management disclosure following exposure to the 
experimental manipulations and asking them to indicate their willingness to rely on that disclosure. _ 





participants viewed an additional management disclosure after experiencing the experimen- 
tal manipulations and indicated their willingness to rely on the disclosure. Panels A and B 
in Figure 3 show that the link between participants' credibility difference scores and their 
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willingness to rely on management's subsequent disclosure is statistically significant for 
both Short-term (B = 0.24, two-tailed p < 0.01) and Long-term (B = 0.28, two-tailed 
p < 0.01) participants. These results suggest that gains and losses in management's re- 
porting credibility indeed influence their ability to communicate persuasively with investors. 


Y. CONCLUSIONS 

In summary, the results suggest that the factors that affect management's reporting 
credibility differ depending on whether credibility is assessed in the short-term or the long- 
term. In the short-term, management's level of forthcomingness affects its reporting cred- 
ibility. That is, forthcoming disclosure temporarily increases reporting credibility and a lack 
of forthcomingness temporarily decreases reporting credibility. Moreover, negative news 
disclosure decisions appear to be especially important in determining management's re- 
porting credibility in the short-term. However, the results also suggest that these credibility 
effects are fleeting. In the long-term, management's reporting credibility is primarily a 
function of the valence of news disclosed. That is, managers of firms who report positive 
news are rated as having higher reporting credibility than managers who report negative 
news, regardless of the managers' forthcomingness about the news. The paper draws on 
attribution and affect-based reasoning theories from psychology to offer a theoretical frame- 
work that predicts and explains each of these results. 

The results extend the existing accounting literature in several ways. First, they further 
our understanding of the determinants of reporting credibility. Despite the fact that man- 
agement's reporting credibility has a significant impact on management's ability to com- 
municate with investors, we know surprisingly little about how managers develop credibil- 
ity. Previous studies (Libby and Tan 1999; 'Tan et al. 2002) show that managers who provide 
forthcoming disclosures are more credible than managers who are not forthcoming. How- 
ever, reporting credibility is not the primary focus in these studies, and consequently the 
authors do not offer a theory for how and when forthcomingness affects reporting credi- 
bility. The present paper extends this early work by developing the first theory-based model 
of how managers' disclosure decisions influence their reporting credibility. Importantly, the 
model suggests that the credibility effects demonstrated in prior studies only hold in the 
short-term. Tests of the model confirm this prediction—there appear to be few, if any, long- 
term credibility benefits to more forthcoming disclosure. 

Second, the results may have implications for researchers who use AIMR reports to 
measure disclosure quality. AIMR reports are based on financial analysts' annual ratings of 
firms’ disclosure policies. These ratings are typically made a significant time after the firms’ 
disclosure decisions. Consequently, the model suggests that AIMR ratings will be based on 
analysts' affective reactions to the news disclosed instead of their cognitive reactions to the 
disclosure decisions. That is, the ratings likely reflect firms’ recent financial performance 
rather than disclosure quality. Lang and Lundholm (1993) provide support for this idea. 
They show that there is a positive relation between a firm's AIMR ratings and its earnings 
and stock price performance. They argue that their results are due to firms' tendencies to 
provide more complete disclosures when they are doing well. In other words, Lang and 
Lundholm (1993) suggest that firm performance affects a firm's actual disclosures, which, 
in turn, affect AIMR ratings. My model suggests a simpler explanation. AIMR ratings may 
be a direct result of recent firm performance, independent of any actual disclosure differ- 
ences across firms. If this is true, then researchers should use caution when using AIMR 
ratings to measure disclosure quality. 

Third, the results may inform the literature on why firms provide earnings warnings. 
Studies show that firms that provide warnings about negative earnings news are penalized 
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with larger immediate stock price reactions than firms that do not provide such warnings 
(Kasznik and Lev 1995; Libby and Tan 1999). Despite this apparent penalty for warning 
investors about negative news, a number of firms continue to provide negative news earnings 
warnings (Skinner 1994). One proposed explanation for this phenomenon is that the im- 
mediate valuation benefits associated with failing to warn are outweighed by the long-term 
credibility costs of failing to warn (Kasznik and Lev 1995). However, the present study 
shows that managers who fail to warn about negative news do not take a long-term credi- 
bility hit relative to managers who do warn. If managers are aware of the short-term nature 
of any credibility effects resulting from nonforthcoming disclosures, then long-term credi- 
bility concerns cannot explain negative news earnings warnings. 

The psychological forces underlying the study's predictions are likely to be common 
across a broad range of people. Specifically, the study's short-term credibility findings are 
expected to generalize to all investors who understand managers' disclosure incentives. In 
other words, I would expect a similar pattern of short-term results for more experienced 
investors such as financial analysts because these investors have a highly developed under- 
standing of managers' disclosure incentives. However, highly inexperienced investors may 
not reward managers for forthcoming disclosure if they do not understand that managers 
sometimes have incentives to withhold information. 

The study's long-term credibility findings should generalize to both more- and less- 
experienced investors, as research in psychology shows that the role of affect in memory 
is hard-wired (Damasio 1994). In other words, all investors who feel good (bad) when the 
company reports positive (negative) unexpected earnings news should show a similar pattern 
of long-term results. Nevertheless, it is possible that investors who are more experienced 
than my M.B.A. participants develop coping mechanisms that help them avoid the judgment 
errors implied by my long-term credibility findings. For example, more sophisticated in- 
vestors may develop better memory structures that allow them to retain their cognitive 
reactions for longer periods or review disclosures frequently so they reside permanently in 
a "short-term" world. If true, then the influence of affect on more sophisticated investors’ 
credibility judgments may be less than that observed in my study. However, research in 
other domains suggests that even the most sophisticated investors are unlikely to escape 
the pull of affect-based reasoning. Brown and Perry (1994) show that Fortune magazine’s 
annual list of most admired companies, which is based on a survey of 8,000 executives, 
directors, and market analysts, is unduly influenced by a firm’s recent financial performance. 
Consistent with my theoretical framework and results, they find that a firm’s recent financial 
performance creates a “halo” that colors sophisticated investors’ judgments of the firm’s 
other attributes (e.g., social responsibility, human resource function, and management 
quality). 

The present study manipulates management’s forthcomingness by varying the timeli- 
ness of management’s disclosures (i.e., whether management provides a warning about 
unexpected earnings news). However, other factors, such as disclosure completeness and 
disclosure accuracy, may also affect the perceived forthcomingness of a management dis- 
closure. Identifying these factors, and determining whether and how they interact, is a 
potentially rich avenue for future research. Likewise, future researchers may wish to con- 
sider whether investors respond to patterns in a firm’s disclosures. For example, do investors 
regard failures to provide earnings warnings to be less diagnostic of a firm’s forthcoming- 
ness when the firm rarely provides such warnings? 

Finally, future researchers may wish to examine how the credibility effects of disclosure 
decisions vary when management has a well-developed reporting reputation. My study 
focuses on the development of investors’ beliefs about management. That 1s, I examine how 
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management's disclosure decisions affect its reporting credibility when investors do not 
have strongly held beliefs about management. The credibility effects identified here may 
differ when investors have strong beliefs about management. For example, investors who 
strongly believe that management is credible may interpret a lack of forthcomingness more 
charitably than investors who harbor suspicions about management's credibility (Ross and 
Lepper 1980; Koehler 1993). Future investigations of these issues will provide a more 
complete understanding of the determinants of reporting credibility. 





APPENDIX 
Management Reporting Credibility Questions 


l. I believe that Dentex management is very competent at providing financial 
disclosures. . 

2. Ibelieve that Dentex management has little knowledge of the factors involved in 
providing useful disclosures. 

3. I believe that few people are as qualified as Dentex management to provide 

useful financial disclosures about Dentex. 

I believe that Dentex management is very trustworthy. 

I believe that Dentex management is very honest. 

I believe that Dentex management may not be truthful in their financial 

disclosures. 


ON uo 


Participants evaluated each statement on a seven-point scale with endpoints labeled 1 — Strongly Disagree and 7 
= Strongly Agree. Participants evaluated each statement twice—once before viewing the experimental 
manipulations (Pre-test) and once after viewing the experimental manipulations (Post-test). To minimize response 
bias, high credibility was sometimes indicated by agreement (i.e., high ratings) and sometimes indicated by 
disagreement (i.e., low ratings); before analysis, responses were recoded so that aigher responses always 
indicated greater perceived credibility. After this recoding, responses were summed to form a composite measure 
of management's reporting credibility. Changes in management's reporting credibility were computed by 
subtracting participants’ pre-test composite credibility assessments from their post-test composite credibility 
assessments. 
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The following table contains information about turnaround time for manuscripts (including revisions) 
on which editorial decisions were made in the 12-month period ended August 31, 2004. Turnaround 
time is the number of days between the date that the manuscript was received and the date of the 
editor's letter to the author(s): 


Ed 


Number of Cumulative Cumulative 
Manuscripts Percent Number Percent 
0 = Days = 30 65 16.46 65 16.46 
31 = Days = 60 145 36.71 210 53.16 
61 = Days = 90 118 29.87 328 83.04 
91 < Days s 120 56 14.18 384 97.21 
121 = Days H 2.78 395 100.00 

Total 395 100.00 


The mean review time was 60.68 days; the median review time was 59 days. 


New Submissions by calendar year (January 1 to December 31) 


1992 241 
1993 234 
1994 231 
1995 195 
1996 230 
1997 I95 
1998 196 
1999 239 
2000 260 
2001 260* 
2002 324 
2003 327 


2004 Aug 31 210 (8 mths) 


* 2001—in addition to the 260 regular submissions, 68 MSs were submitted to the TAR Quality of 
Earnings Conference. 


The acceptance rate (defined as number of MSs accepted divided by the number of new submissions) 
over the past 5 years is between 12-15% per year. 
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An Abstract of about 100 words should be presented on a separate page immediately preceding the text. 
The Abstract should concisely inform the reader of the manuscript's topic, its methods, and its findings. 
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methodology, and findings. Both the Abstract and the Introduction should be relatively nontechnical, yet 
clear enough for an informed reader to understand the manuscript's contribution. The manuscript's title, but 
neither the author's name nor other identification designations, should appear on the Abstract page. 


TABLES AND FIGURES 
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5. Revisions must be submitted within 12 months from request, otherwise they will be considered new 
submissions. 


COMMENTS 


Comments on articles previously published in The Accounting Review will be reviewed (anonymously) by 
two reviewers in sequence. The first reviewer will be the author of the original article being subjected to 
critique. If substance permits, a suitably revised comment will be sent to a second reviewer to determine its 
publishability in The Accounting Review. If a comment is accepted for publication, the original author will 
be invited to reply. All other editorial requirements, as enumerated above, also apply to proposed comments. 


POLICY ON REPRODUCTION 


An objective of The Accounting Review is to promote the wide dissemination of the results of systematic 
scholarly inquiries into the broad field of accounting. 

Permission is hereby granted to reproduce any of the contents of the Review for use in courses of 
instruction, as long as the source and American Accounting Association copyright are indicated in any such 
reproductions. 

Written application must be made to the American Accounting Association, 5717 Bessie Drive, Sarasota, 
FL 34233-2399, for permission to reproduce any of the contents of the Review for use in other than courses 
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of instruction—e.g., inclusion in books of readings or in any other publications intended for general distri- 
bution. In consideration for the grant of permission by the Review in such instances, the applicant must 
notify the author(s) in writing of the intended use to be made of each reproduction. Normally, the Review 
will not assess a charge for the waiver of copyright. 

Except where otherwise noted in articles, the copyright interest has been transferred to the American 
Accounting Association. Where the author(s) has (have) not transferred the copyright to the Association, 
applicants must seek permission to reproduce (for all purposes) directly from the author(s). 


POLICY ON DATA AVAILABILITY 


The following policy has been adopted by the Executive Committee in its April 1989 meeting. “An 
objective of (The Accounting Review, Accounting Horizons, Issues in Accounting Education) is to provide 
the widest possible dissemination of knowledge based on systematic scholarly inquiries into accounting as 
a field of professional research, and educational activity. As part of this process, authors are encouraged to 
make their data available for use by others in extending or replicating results reported in their articles. 
Authors of articles which report data dependent results should footnote the status of data availability and, 
when pertinent, this should be accompanied by information on how the data may be obtained." 
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APRIL 2005 PLACEMENT ADS 


The deadline for information to be included in the “Placement Information” section 
is two months prior to the date of publication. The Review is published in January, 
April, July, and October. Placement advertising and replies should be sent to the 
American Accounting Association, 5717 Bessie Drive, Sarasota, FL 34233-2399, 
ATTENTION: ADVERTISING DEPT. All replies to box numbers should be on sep- 
arate sheets to facilitate forwarding. 


201 KENT STATE UNIVERSITY, Department of Accounting, invites applications for an Associate 
Professor with teaching emphasis in Managerial Accounting beginning in Fall 2005. The suc- 
cessful candidate will also serve as a KPMG Faculty Fellow. Candidates must have a high-quality, 
established research record. A Ph.D. with a major in Accounting or an appropriate related field 
is required. Professional certification such as CPA or CMA is highly desirable. All candidates 
must be able to demonstrate evidence of discovery/integration research and of effective under- 
graduate and graduate teaching. Research will be expected in high-quality, academic refereed 
journals for.reappointment, tenure, and promotion. Participation in teaching doctoral students and 
serving on and/or supervising their dissertations will be an important part of the faculty member's 
responsibilities. Review of applications will begin immediately and continue until the position is 
filled. If you are interested in this position, please contact and/or mail a copy of your curriculum 
vitae to Dr. Pervaiz Alam, Chair, Faculty Search Committee, Department of Accounting, 576 
BSA, Kent State University, Kent, OH 44242; Phone: (330) 672-1121; Fax: (330) 672-2548; 
Email: palam Q bsa3.kent.edu. 


202 UNIVERSITY OF NOTRE DAME, Department of Accountancy, invites applications for position 
openings at the Assistant and Associate Professor levels. All research areas will be considered. 
Applicants for Assistant Professor positions should expect to complete doctoral degree require- 
ments before the start of the contract period. All candidates should have a strong commitment to 
scholarly research and excellence in teaching. Salary is very competitive and research support is 
excellent. Please send curriculum vitae and an example of scholarly work to Professor Tom 
Schaefer, Chair, Department of Accountancy, Room 102D, Mendoza College of Business Ad- 
ministration, University of Notre Dame, Notre Dame, IN 46556-5646. The University of Notre 
Dame is an Equal Opportunity, Affirmative Action Employer. 


203 INSTITUTO DE EMPRESA, Madrid (Spain), is seeking faculty for tenure-track positions in 
Accounting beginning Fall 2005. We seek candidates with a Ph.D. or D.B.A. degree in Accounting 
from an accredited university who have substantial experience in teaching and research. Candi- 
dates must be fluent in English. Experience with the case-teaching method and fluency in Spanish 
is not required but highly valued. Please submit an up-to-date curriculum vitae, three reference 
letters, publication samples, and a statement describing teaching and research interest to Patricia 
Garcia, Email: Patricia.Garcia @ie.edu. 
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204 GRAND VALLEY STATE UNIVERSITY, Seidman College of Business, invites applications for 
Assistant/ Associate Professor in Accounting. Tenure-track position is available Fall 2005 and will 
remain open until filled. Need Ph.D. or D.B.A. (by December 2005) in Accounting or related 
field. Professional certification is desirable. Cost/ Managerial Accounting and Financial Account- 
ing are primary needs; other fields may be considered. Applicant must demonstrate the ability to 
teach effectively and conduct research. The Department and College are both accredited by the 
AACSB. Salary is competitive. For consideration, send resume, three references, sample of teach- 
ing evaluations, and copies of publications or representative research to Ms. Kathy Goralski, 
Accounting Search Committee, Department of Accounting and Taxation, Grand Valley State Uni- 
versity, 401 West Fulton Street, Grand Rapids, MI 49504, or send email to goralskk @gvsu.edu. 
Position is subject to availability of funding. Grand Valley State University is an EO/AA 
Institution. 


205 UNIVERSITY OF NORTHERN COLORADO invites applications for a tenure-track Assistant/ 
Associate level position teaching Financial Accounting and at least one other area with a pref- 
erence for Governmental Accounting. Candidates must have completed all requirements for doc- 
toral degree in Accounting by the position starting date of Fall 2005. This position will be filled 
as soon as an acceptable candidate is found. Candidates must be committed to excellence in 
teaching; professional and academic research; and university and community service. The Monfort 
College of Business undergraduate-only Accounting program is AACSB-accredited. The College 
is a 2004 recipient of the Malcolm Baldrige National Quality Award. Candidates should submit 
a cover letter, curriculum vitae, evidence of teaching excellence, and three references to Dr. Terri 
Gutierrez, Accounting Department, Kenneth W. Monfort College of Business, Campus Box 128, 
Greeley, CO 80639. UNC, located 50 miles north of Denver, is an AA/EO Employer. 


206 UNIVERSITY OF CALIFORNIA, LOS ANGELES, The John E. Anderson Graduate School of 
Management, invites applications for positions in Accounting at both the junior and senior levels 
beginning July 1, 2005. Applicants must have a Ph.D. or be a Ph.D. candidate and exhibit out- 
standing research capabilities. Curriculum vitae and working papers should be sent to Professor 
Brett Trueman, The John E. Anderson Graduate School of Management, Box 951481, UCLA, 
Los Angeles, CA 90095-1481. Under federal law, the University of California may employ only 
individuals who are legally authorized to work in the United States as established by providing 
documents specified in the Immigration Reform and Control Act of 1986. UCLA 1s an Affirmative 


Action, Equal Opportunity Employer. 


207 MANHATTAN COLLEGE, School of Business, an AACSB-accredited school, seeks applicants 
for a tenure-track position as an Assistant or Associate Professor of Accounting, beginning in 
August 2005. Candidates must have a Ph.D. or D.B.A. in Accounting as well as evidence of 
teaching effectiveness and research competence. Responsibilities include teaching undergraduate 
courses in Accounting, engaging in scholarly research, and participating in campus activities. The 
Department seeks a candidate willing to teach Taxation, but all qualified candidates are encouraged 
to apply. Manhattan College is an independent Catholic college founded by the De La Salle 
Christian Brothers. The College is in the Riverdale section of New ‘York City. Recruiting begins 
immediately and continues until the position is filled. Salary and rank will be based on qualifi- 
cations. Please send a letter of application, curriculum vitae, and li3t of three references to Dr. 
Mary Michel, Chairperson of Accounting, Law and Computer Information Systems, School of 
Business, Manhattan College, Riverdale, NY 10471 (or email: mary.michel € manhattan.edu). 
Manhattan College is an Affirmative Action/Equal Opportunity Employer. : 
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208 FERRIS STATE UNIVERSITY invites applications for a tenure-track Accounting position at the 
rank of Assistant Professor beginning August 2005. Applicants from all areas of Accounting are 
encouraged to apply. À doctorate in Accounting or Ph.D. in related field from an accredited 
institution with Accounting certification is required (ABDs will be considered). Candidates should 
possess an excellent record in teaching, scholarship, and service, as well as a demonstrated ca- 
pacity to incorporate current teaching technology. Business experience in the field, professional 
certification, and relevant teaching experience in Financial Accounting and/or Accounting Sys- 
tems are preferred. Please submit a letter of interest, curriculum vitae, names/contact information 
of three references, and sample copies of student assessment of instruction to JOB CODE AAA- 
2620, Dr. Mohammed A. El-Saidi, AFES Dept. Head, Ferris State University, 420 Oak St., PRK- 
150, Big Rapids, MI 49307. For more information about Ferris, please visit our website at http: 
//www-ferris.edu/. Ferris State University is an Equal Opportunity/ Affirmative Action Employer. 


209 UNIVERSITY OF ARKANSAS invites applications and nominations for the position of Dean of 
the Sam M. Walton College of Business. The Dean is the chief academic and administrative 
officer of the College, and reports to the Provost for Academic Affairs. Salary is competitive and 
commensurate with experience and qualifications. For the position description and information 
about the University, please visit: Applicants are requested to submit a letter of application, 
curriculum vitae, and five professional references to Dr. G. David Gearhart, Chair of the Search 
Committee, Vice Chancellor for University Advancemént, 416 Administration Building, Univer- 
sity of Arkansas, Fayetteville, AR 72701. The selection process will begin March 1, 2005, and 
continue until the position is filled. Under the state's Freedom of Information Act, after selected 
candidates are invited to campus, their names are available to the public upon request. The Uni- 
versity of Arkansas is an Equal Opportunity, Affirmative Action Institution. 


210 UNITED ARAB EMIRATES UNIVERSITY seeks qualified faculty at the levels of Instructor, 
Assistant Professor, Associate Professor, and Professor for academic year 2005—2006 (which starts 
in September 2005). Instructors are expected to be holders of a Master's degree; other ranks must 
hold the terminal degree in the appropriate discipline. The principal responsibility of each faculty 
is undergraduate instruction. Active research is expected as well. Excellent research facilities and 
a variety of support are provided. Opportunities are available to teach in the following areas of 
Accounting: Auditing, Accounting Information Systems, Cost, and Managerial Accounting. Sal- 
aries and benefits include: tax-exempt compensation package; housing; furniture and utilities al- 
lowances; employer-paid comprehensive health insurance; children's education allowance; annual 
airline tickets for employee and family; end-of-service gratuity; generous vacation package. If 
you are interested in joining a professional team dedicated to ensuring growth and development, 
please visit: http: //academics.uaeu.ac.ae/jobs/. 


211 UNIVERSITY OF MINNESOTA, Carlson School of Management, invites applications for tenure- 
track faculty positions at all ranks in Accounting. We offer competitive salary and benefits in a 
stimulating research environment. All applicants must possess a doctoral degree (or be close to 
completion), and have a demonstrated commitment to Accounting research and teaching com- 
mensurate with rank. Applications will be accepted until positions are filled. Interested candidates 
should send applications, including curriculum vitae, research papers, and at least two references 
from established scholars to Chair, Search Committee, Department of Accounting, 3-122 Carlson 
School of Management, University of Minnesota, 321 19th Avenue South, Minneapolis, MN 
55455. The University of Minnesota is an Equal Opportunity Educator and Employer. 
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212 SETON HALL UNIVERSITY, Department of Accounting and Taxation, invites applications for 
a tenure-track position as Assistant Professor of Accounting beginning September 2005. Appli- 
cants must have a Ph.D. in Accounting and should demonstrate evidence of a strong commitment 
to excellence in teaching and research scholarship. Teaching responsibilities will be primarily in 
Financial and Managerial Accounting. Salary is competitive. Please send curriculum vitae to 
Professor David S. Gelb, Chair, Department of Accounting and Taxation, Stillman School of 
Business, Seton Hall University, 400 South Orange Avenue, South Orange, New Jersey 07079- 
2692. Applications will be accepted until a successful candidate is found. 


213 BUENA VISTA UNIVERSITY, Harold Walter Siebens School of Business, invites applications 
for tenure-track position at the Assistant Professor level to begin Fall 2005. The successful can- 
didate should have an earned doctorate in Accounting, although ABD or appropriate Master's 
with certification will be considered. Candidates must be dedicated to excellence in teaching. 
university, and community service and scholarly activity. Teaching in the areas of Financial, 
Managerial, Auditing, Tax, and Certificate Preparation is expected. BVU is nestled beside a beau- 
tiful 3,000-acre lake in the richly diverse northwest Iowa community of Storm Lake. Interested 
applicants should submit a cover letter, curriculum vitae, names ard telephone numbers of four 
references, unofficial transcripts, and evidence of teaching excellence to Dr. Mary Gill, Associate 
Dean of Faculty, Buena Vista University, 610 West 4th Street, Storm Lake, IA 50588. See com- 
plete description at: http: // www.bvu.edu/departments/businessservices/hr/jobopenings.asp. Re- 
view of applications will begin immediately and continue until the position is filled. Women and 
minorities are strongly urged to apply. ADA/EOE. 
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G Guo Tai An Information Technology Co., Ltd. 


China Stock Market, Bond Market, Banking and Economic Databases for Research 


eee ae ee ee ss of Wharton Research Data Service (WRDS) (please 
1 ails : Asda! Bin. i 





GTA’s databases are designed to be compatible with international database standards and have been verified by more 
than 100 professors and experts. 


Since 2000, GTA has been serving over 2,000 international scholars and researchers from more than 120 highly reputable 
educational and professional institutions, such as Prumerica, Merrill Lynch, Barra China Securities Regulatory 
Commission, Shanghai Stock Exchange, Yale University, New York University, Wharton School, University of 
Wisconsin — Madison, University of California at Irvin, University of Reading(UK), Universuty of Pittsburgh, the 
National University of Singapore, University of Waikato(NZ), Sydney University, University of Hong Kong, Chinese 
University of Hong Kong, Hong Kong University of Science and Technology, Tsinghua University, Peking University, 
Fudan University, Shanghai Jiaotong University, Xiamen University, Shanghai University of Finance and Economics, 
Xi’an Jiaotong University, and Nanjing University. For details, please visit www,chinagtait. com. 


CSMAR major research databases (GTA provides also contracted/customized research service) 


China Macro-economy 

* China Macro-economy Database 

* China Industry Research Database 

China Stock Market 

* CSMAR China Stock Market Trading Database 

* China Stock Trade and Quote Research Database 

* CSMAR China Stock Market Financial Database—Annual/ Interim/ Quarterly Report 
* China Listed Companies' Financial Ratio Research Database 

* China Initial Public Offering Research Database 

* China Listed Corporate Governance Research Database 

* China Listed Companies Merger & Acquisition, Asset Restructuring Research Database 
* China Stock Market Information Disclosure System 

* China Stock Market Related Party Transaction Research Database 
China Fund Market. 

* China Securities Investment Fund Research Database 


* China Close-end Fund Trade and Quote Research Database 
China Bond Market 

* China Listed Bond Research Database 

* China Listed Convertible Bond Research Database 

* China Bond Trade and Quote Research Database 

China Futures Market 

* China Futures Research Database 

Hong Kong Securities Market 

* Hong Kong GEM Stock Market Trading Database 

* Hong Kong GEM IPO Research Database 


Selected Products for securities analysis purposes: 


* Daily updating system of China Stock Market & Accounting Research Database (CSMAR): Including financial 
and trading data updated on a daily basis. 


* China ValueLine (CVL) system 

* China Securities Information and Analysis System (CSIAS) 

* China Securities Excel System (CSXcel) 

For free sample databases, mannals, brochures, and more information, please contact us or visit our website: 


GTA Information Technology Co., Ltd. http /orerecchinagtgit com. 
Hang Kong. Chine: 85227812531 Te) 8522781577) Shenzhen China: 86-755-83940235 (Tel) 86-755-83940070 (Fax) 
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